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Diversity challenge 


There is growing evidence that embracing diversity — in allits senses — is key to doing good 
science. But there is still work to be done to ensure that inclusivity is the default, not the exception. 


College London, wrote about the prejudice he experiences as a 

gay scientist. Intolerant peers jump to conclusions, insult him and 
make assumptions about his beliefs and behaviour. It is better, Welton 
wrote, to hide behind a lie: “I often find it easier to say ‘I'm a teacher.” 

Scientist colleagues had no problem with him being gay. But he found 
that people in the lesbian, gay, bisexual and transgender community 
seemed to have a problem with him being a scientist. “Most scientists, 
medics and engineers know that unless they have a stethoscope around 
their neck they aren't valued,’ he wrote. 

As we explore in a Feature on page 297, others may have a different 
experience. Scientists, of course, should not be judged by their sexual- 
ity. The principles of research — reliance on data, rigorous experimen- 
tation and respect for evidence — do not cluster by any of the ways 
that humans choose to define themselves and each other. Gender, race, 
ethnic background, social status, wealth, nationality, age, skin colour 
and sexuality are as irrelevant to doing science as a person's musical 
taste or dietary preference. Or are they? 

There is no place in science (or outside it) for prejudice. But there 
must be a place for diversity, and there is growing evidence that such 
variety is a key ingredient for doing good science. Much of that evi- 
dence is discussed this week in a joint special issue of Nature and our 
sister publication Scientific American. 

Diversity is a vague word. The special-issue content (available at 
nature.com/diversity) is wide-ranging and covers much ground. It 
can be usefully tied together by a working definition: diversity means 
an inclusive approach, both to the science itself and the make-up of 
the groups of people who carry out the research. 

Diversity is a topic too often discussed in the negative, through 
stories of discrimination and bias against select communities. Science 
has its problems here just like most of society, and Nature has long 
spoken out, for example, against the under-representation of women. 
Much of the special-issue content frames the subject in a different way, 
and examines the benefits of an inclusive approach. 

Attention, busy scientists: if diversity sounds like a worthy topic 
but one better left to your university's human-resources department, 
then turn to page 305, where Richard Freeman and Wei Huang explain 
how it might boost your citation rate. Their analysis of the surnames 
of US-based authors on some 2.5 million research papers suggests 
that scientists who tend to stick with their own kind publish less-cited 
work, and in lower-impact journals. 

Why published collaborations with a greater mixture of surnames 
perform better is unknown. What is clearer is that a mixture of people 
(mixed across what- 
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possible solutions to a problem. If the problem is scientific, then the 
result of that diversity can be better science. On page 301, for example, 
Esteban Burchard describes how his ethnic background and experi- 
ences with a variety of cultures have helped him to study the genetics 
of asthma in Latino Americans. On page 304, Monica Ruiz-Casares 
highlights how the results of mental-health research based on adult, 
2 Western populations might not apply to other 
“There is aplace cultures Hai soneaniiaes ae 
for positive But collaboration that spans vast personal 
discrimination to differences can raise problems. On page 303, 
address specific — Wenzel Geissler and Ferdinand Okwaro dis- 
imbalances.” cuss the sometimes-fraught scenarios that arise 
when researchers from very different economic 
backgrounds work closely together. To draw attention to this inequality 
can be awkward, say the duo, but that is better than the destructive ways 
it can surface ifignored. 

Science has already been through one revolution in diversity. 
Traditional academic silos that held subjects as distinct disciplines have 
crumbled. Interdisciplinary research now sets the agenda in many fields, 
especially those with a direct impact on society, such as climate-change 
research. That shift, although beneficial, was not entirely spontaneous. 
It was managed and encouraged by senior scientists and funders, who 
saw the pay-off. To fully develop the benefits of diversity, to ensure that 
science becomes fully inclusive, similar intervention is necessary — even 
if it is as simple as a busy lab head stopping to consider the issue for the 
first time. 

Asa telling graphic in the special issue of Scientific American illus- 
trates, 51% of the science and engineering workforce in the United 
States is white and male. There is a place for positive discrimination to 
address specific imbalances. But diversity is not just a case of champion- 
ing minority interests — the benefits of diversity go to the majority. m 


A worthy ambition 


Finalizing the European Research Areais still 
avibrant and relevant goal. 


“gradual process’, admits the European Commission rather 

forlornly, at the conclusion of a report it published earlier this 

week on progress towards an entity within which European research- 
ers and their ideas can circulate freely. 

The European Research Area (ERA) was originally due to be final- 

ized by the end of this year. The notion that this could happen, set 
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in train only two years ago by Maire Geoghegan-Quinn — who will 
depart as research commissioner of the European Union (EU) this 
autumn — was always as fanciful as it was beside the point. 

That is because the ERA is a process, not an event. The project 
will never end. Anyone who imagines that it might do so only has 
to look at the United States. There, despite a genuinely single market 
and decades of federal incentives, huge disparities persist in ‘research 
excellence’ — however it is measured — between, say, Massachusetts 
and Montana. 

That the problem is difficult does not mean that it should not be 
addressed. Optimists will note the remarkable progress that has been 
made in European research collaboration over the past 50 years and, 
in particular, over the past 15. Huge EU research programmes have 
forged active collaboration involving tens of thousands of scientists. 
Academic mobility between nation states is visibly increasing, every- 
where you look. 

Almost all major facilities are now planned on the basis of pan- 
European collaboration. Earlier this month, ground was broken on the 
latest of these: the European Spallation Source near Lund in Sweden, 
paid for by 17 European nations. (It is worth noting that the United 
States has not managed to start work on a billion-dollar-scale research 
facility for more than a decade.) 

Most importantly, a cohesiveness and mutual understanding has 
emerged between senior European scientists that most parts of the 
world can only look upon with envy. Compared with the situation in 
east Asia in particular, the level of everyday dialogue and collabora- 
tion that exists in Europe in several major disciplines, such as particle 
physics and molecular biology, is singularly impressive. 

This process had been going on for decades, before the formal 
concept of the ERA was endorsed by EU heads of state at a summit 
meeting in Lisbon in 2000. 

The idea of taking specific administrative steps to improve 
researcher mobility was mainly theoretical at first, but has steadily 
gained impetus. And the decision was taken in 2012 for the European 
Commission to report annually on ERA progress, with the aim of 
cajoling more action out of member states. 

At the same time, the political context for the ERA initiative has 


changed for the worse. The ERA was conceived when the EU had just 
experienced a period of rapid convergence — in particular, economic 
convergence between living standards in the north and south. 

Since 2008, however, the health of Europe’s national economies has 
been diverging. Today, in research and innovation, as in other spheres, 
the wealthier regions are moving rapidly ahead, with the poorer ones 
falling behind. On the face of it, this makes 
the ERA’ objectives more elusive than ever. 
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portability of researchers’ pensions and to 
address the gender gap in research. Women 
now obtain around half of Europe's PhDs, but will receive less than a 
quarter of this year’s grants from the prestigious European Research 
Council. This is a major problem that both universities and research 
agencies prefer to overlook; its vigorous pursuit is a worthwhile goal 
for the commission. 

Another change that has intruded on the ERA since 2000 is the 
accelerated emergence of a de facto global research area among elite 
researchers in most disciplines. Since 2000, with the rapid growth of 
the Internet, genuine global research collaboration has become almost 
routine, rendering ‘local’ collaboration less significant. 

Still, Geoghegan-Quinn’s successor as research commissioner — the 
current nominee is Portugal’s Carlos Moedas — should pursue the 
goals of the ERA with as much vigour as possible. There will doubt- 
less be renewed debate in the new European Parliament about the 
need for a fresh EU directive to force member states’ hands over the 
ERA. In the meantime, it is up to the member states and their institu- 
tions to do more. 

The 2014 deadline may be about to pass, but the project must 
endure. Ultimately, its fate rests in the hands of every department, 
institution and research agency in Europe — to build the ERA, one 
step ata time. m= 


Amped-up plants 


Bacterial enzyme supercharges photosynthesis, 
promising increased yields for crops. 


and oxygen is arguably the most important chemical reaction 

in the world, and one of the oldest. It is so old, in fact, that 
it evolved when the world’s atmosphere was much lower in oxygen 
than it is today. So, in a way, photosynthesis is its own worst enemy. 
Thousands of millions of years later, most modern plants struggle to 
photosynthesize because of all the darned oxygen in the air — oxygen 
that they helped to put there. These plants simply cannot distinguish 
between molecules of carbon dioxide and molecules of oxygen, so they 
waste their time and energy grabbing both. 

Some plants can do better — for example, plenty of weeds (ever won- 
dered why they grow so fast?) have evolved ways to concentrate carbon 
dioxide inside their leaves, to supercharge their photosynthesis. Cyano- 
bacteria can do this too. But the majority of plants, including most of 
the crops we rely on for food, have developed a blunter strategy: pro- 
duce lots and lots of the enzyme that drives the reaction. That enzyme, 
Rubisco, is thus among the most abundant proteins on the planet. 

A significant amount of Rubisco still wastes its time grabbing useless 
oxygen — reducing the overall efficiency of global photosynthesis by 
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almost one-third. When they discuss ways to boost the world’s food 
supplies, plenty of plant scientists see leaves’ wasted photosynthesis 
capacity as, well, low-hanging fruit. 

What if crops could borrow the faster-acting Rubisco system of 
weeds and cyanobacteria? In theory, this would dramatically boost 
their growth rate and so their yield, all without needing any extra 
farmland. The appeal of such a strategy is obvious, particularly in 
the face of the often-quoted United Nations demand for global food 
production to double by 2050. 

In practice, replacing the enzyme has proved difficult. But there is 
encouraging news: on Nature’s website, researchers report that they 
have made tobacco plants that use the Rubisco from a cyanobacterium 
(M. T. Lin et al. Nature http://dx.doi.org/10.1038/nature13776; 2014). 
Sure enough, the transformed plants photosynthesize faster and have 
higher rates of CO, turnover than their conventional counterparts. 
Faster-growing tobacco plants might not sound like a boon for global 
welfare, but they do demonstrate what might be possible in future. 
(Tobacco is a common model organism for genetic-engineering 
research.) 

As biologists Dean Price and Susan Howitt write in an accompany- 
ing News & Views (G. D. Price and S. M. Howitt Nature http://dx.doi. 
org/10.1038/nature13749; 2014): “The workis a milestone on the road 
to boosting plant efficiency. The advance can be 
likened to having a new engine block in place in 
a high-performance car engine — now we just 
need the turbocharger fitted and tuned” Avail- 
able in any colour you like, as long as it’s green. m 
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pass their deadline next year and will be replaced by the broader 
and more ambitious Sustainable Development Goals (SDGs) 
to guide world development until 2030. 

The SDGs matter because they will set development priorities for 
governments and businesses, among others. Moreover, they can help 
to reshape attitudes towards the relationship between economic growth 
and environmental protection, to help preserve and protect both. 

Draft goals were presented to the UN General Assembly last week in 
New York. A year of negotiations follows, with the final version of the 
goals scheduled to be affirmed in September 2015. That the world is 
close to agreeing on a consolidated set of objectives for global sustain- 
ability is a game-changer. 

However, it is crucial that the new goals are 
based on the best scientific evidence of envi- 
ronmental problems and the best strategies to 
mitigate these risks. Scientists have helped to 
draft the proposed goals, but their input has been 
weak, fragmented and intermittent. We have less 
than 12 months to change that. 

The first problem is that there are too many 
proposals: 17 goals encompassing 169 individual 
targets, ranging from improving maternal health 
to safeguarding the oceans. The strategy has 
shifted from a list of priorities to an unwieldy and 
impractical catch-all. The strength of the original 
MDGs was their focus. 

We should aim for no more than ten goals, 
with around five or six targets for each. This 
should offer the right balance between covering enough ground and 
providing sharp focus. These ten goals should cover social, economic 
and environmental priorities, and on these points the draft propos- 
als make a good start. Four draft goals discuss global environmental 
constraints, for climate, water, ecosystems and the oceans. This is a 
step forward that should be applauded. 

Although many of the proposed social targets are ambitious, aspira- 
tional and reasonably well defined, the biophysical targets are vague, 
modest and lack detailed quantification. For example, under the health 
goal, the first target is specific: “By 2030 reduce the global maternal 
mortality ratio to less than 70 per 100,000 live births.” By contrast, 
the sustainability target under the food-security goal starts: “By 2030 
ensure sustainable food production systems”. The target is nebulous 
and, crucially, omits mention of important constraints on the nitro- 
gen, phosphorus and water cycles. A water target is equally vague: “By 
2030, substantially increase water-use efficiency 
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UN sustainability goals 
need quantified targets 


Scientists must step up and secure meaningful objectives if they are to 
protect both people and planet, says Mark Stafford-Smith. 


in energy, resource and land-use systems. Without quantified targets 
and monitoring, it is impossible to determine whether sufficient 
progress is being made. 

We already know enough about the biophysical systems involved 
to set specific targets, such as keeping the flow of phosphorus into the 
ocean to below 11 million tonnes per year. 

Perhaps most importantly, the goals must work towards a common 
purpose. At present, individual goals on energy access and tackling cli- 
mate change could contradict each other — massive expansion of fossil- 
fuel use, for example, would satisfy one goal but undermine the other. To 
prevent this, the goals must be integrated. There are perceived trade-offs 
between securing the long-term stability and health of the Earth system, 
and securing water, food and energy security in 
the short term. But this need not be the case. An 
integrated approach to food security could also 
ensure that sustainability targets for nutrient and 
water cycles are met. For example, we should aim, 
by 2030, to use no more than 1,000 cubic metres 
of water per tonne of key food crops produced. 

Ina similar way, the current potential conflicts 
between the goals of delivering energy for all and 
limiting greenhouse-gas emissions can be medi- 
ated by strong integrative targets: decrease carbon 
intensity by increasing the share of renewable 
energy to 30%, and increase energy intensity by 
2.4% per year. Current targets do address these 
two issues, but without quantification. 

These are realistic and achievable changes. But 
the research community must convince policy- 
makers that such changes are important. Organizations such as the 
Future Earth initiative, the UN’s Sustainable Development Solutions 
Network and the UN Secretary-General’s Scientific Advisory Board 
must ensure that the right expertise is brought to bear on this challenge 
at international and regional levels. 

Ata national level, funding agencies and scientific academies need 
to bring together expertise to support this international process. 
Scientists should identify and talk to the negotiators who will finalize 
the draft goals. 

2015 is a significant year for international politics related to global 
change. Nations will also agree on a new climate deal and a strategy for 
disaster-risk reduction. Traditionally, science has struggled to respond 
flexibly to the demands and speed of some political processes. But 
the SDGs are too important for the research community to let the 
opportunity pass. 


Mark Stafford-Smith is chair of the Science Committee of Future 
Earth, and principal research scientist with the Commonwealth 
Scientific and Industrial Research Organisation in Canberra, Australia. 
e-mail: mark. staffordsmith@csiro.au 
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Wide area of Ebola 
risk in Africa 


The region in Africa at risk 
of an outbreak of the Ebola 
virus is larger than previously 
thought. 

Simon Hay at the University 
of Oxford, UK, and his team 
mapped data from 23 Ebola 
outbreaks in humans, 
including the current one, and 
51 reports of Ebola in other 
animals. They combined the 
data with information on 
human mobility and the range 
of animal hosts suspected of 
carrying the virus, such as Old 
World fruit bats. 

The team found that the 
potential reservoir for the 
virus spans 22 countries 
in western and central 
Africa, and includes an 
area containing more than 
15 million people, where Ebola 
cases have already occurred. 

This finding, combined 
with other recent trends such 
as increasing urbanization, 
may account for an apparent 
increase in the frequency and 
size of outbreaks since 2000. 
eLife http://doi.org/vms (2014) 


How a shark used 
its saw-like jaw 
Despite having a set of teeth 
shaped like a circular saw, 


an extinct shark probably 
devoured only soft-bodied prey. 


RCH HIGHLIGHTS 


Selections from the 
scientific literature 


ECOLOGY 


Bird diversity at risk from farming 


Birds that have the longest evolutionary 
history are also the most threatened by 


agriculture. 


Luke Frishkoff at Stanford University in 
California, Daniel Karp at the University of 
California, Berkeley, and their team studied 
12 years of bird survey data, covering nearly 
500 species from three types of land use in 
Costa Rica: forests, diversified agriculture and 
intensive farming of just a few crop species. 
They found that on farmland, evolutionarily 


The spiral-shaped tooth 
arrangement (pictured) of 
Helicoprion davisii, an animal 
some 300 million years old, has 
puzzled palaeontologists for 
more than a century. Last year, 
a team determined that the 
teeth were surrounded by the 
shark’s lower jaw. 

Ina follow-up study, Jason 
Ramsay at the University of 
Rhode Island in Kingston 
and his team used computed 
tomography scans of the 

fossils to reconstruct the 
jaw muscles and model 
the mechanics of the jaw 
and teeth to determine 
how and what the animal 
ate. They concluded that 
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distinct birds, which are related to few other 
living species — such as the rufous-tailed 


jacamar (Galbula ruficauda; pictured) — went 


older teeth at the front of the 
jaw snagged prey whereas 
younger, stronger teeth deeper 
in finished them off. 

The shark teeth were rarely 
worn or broken, suggesting 
the animals ate soft-bodied sea 
creatures such as cephalopods. 
J. Morphol. http://dx.doi. 
org/10.1002/jmore.20319 (2014) 


Surface heat led to 
ice-shelf demise 


The collapse of Antarctica’s 
giant Larsen B Ice Shelfin 
2002 was probably caused 
by warming at the surface 


extinct locally at higher rates than those that 
had evolved more recently. 

However, less-intensive agriculture fostered 
greater levels of phylogenetic diversity than 
intensive farming, so the authors suggest that 
this type of agriculture could help to conserve 
some bird evolutionary history. 

Science 345, 1343-1346 (2014) 


rather than by instability at the 
bottom of the ice sheet. 

Eugene Domack at the 
University of South Florida in 
St Petersburg and his colleagues 
mapped the sea floor below 
where the shelf used to be. 
They also analysed marine 
sediment cores to reconstruct 
characteristics of the ice shelf’s 
grounding zone — where 
the floating ice shelf meets 
underlying bedrock — before 
the ice collapsed. 

They found that this zone 
had remained stationary for 
some 12,000 years, challenging 
the idea that structural changes 
at the bottom of the ice shelf 
might have caused Larsen B's 
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disintegration. 

The findings could 
inform estimates of how 
much Antarctic melting will 
contribute to future sea level 
rise, the authors say. 
Science 345, 1354-1358 (2014) 


IMMUNOLOGY 


The gut improves 
vaccine effects 


Bacterial residents of the gut 
boost immune responses to 
vaccination in mice. 

Humans vaccinated against 
the influenza virus ramp up 
expression ofa protein called 
TLRS, which is involved in 
detecting certain types of 
bacterium. To see how this 
protein and gut bacteria might 
affect immune responses to 
vaccines, Bali Pulendran of 
Emory University in Atlanta, 
Georgia, and his colleagues 
studied mice that lack the gene 
encoding TLRS. 

They found that the animals 
produced fewer antibodies in 
response to flu vaccination than 
normal mice. The team saw 
similar effects in mice reared in 
a germ-free environment and 
in those treated with powerful 
antibiotics. Antibody responses 
could be restored, however, by 
inoculating the mice with the 
kind of bacteria to which TLR5 
is sensitive. 

The results suggest that 
antibiotic treatment could 
hinder the effects of certain 
vaccines, the authors say. 
Immunity http://doi.org/vm3 
(2014) 


PARTICLE PHYSICS 


Better estimate of 
Higgs mass 


Researchers have decreased the 
uncertainty of their estimate of 
the mass of the Higgs boson, 
the particle thought to bestow 
mass to matter. 

The ATLAS collaboration, 
one of two teams that 
detected the Higgs at the 
Large Hadron Collider 
near Geneva, Switzerland, 
reanalysed data and improved 
detector calibration to come 
up with the revised mass of 


125.36 gigaelectronvolts (GeV), 
with a systematic uncertainty of 
0.18 GeV — an improvement 
bya factor of three. 

The measurement will 
refine predictions of the 
Higgs’ behaviour and help to 
identify potential phenomena 
not predicted by the standard 
model of physics, the team says. 
Phys. Rev. D 90, 052004 (2014) 


INFECTIOUS DISEASE 


Mosquitoes 
awaken malaria 


Mosquitoes biting a malaria- 
carrying host coax the malaria 
parasite to come out of hiding, 
resulting in greater disease 
transmission. 

Sylvain Gandon at the 
National Centre of Scientific 
Research in Montpellier, 
France, and his colleagues 
infected canaries (Serinus 
canaria) with a malaria 
parasite that is specific to 
birds (Plasmodium relictum), 
and then exposed them to 
mosquitoes that were not 
carrying the parasite. 

After the birds were bitten 
by malaria-free insects, the 
level of parasites rose in the 
birds’ blood. Mosquitoes that 
subsequently bit birds were 
more likely to pick up and 
transmit the parasite than 
insects attacking birds that 
had not been initially bitten. 

The researchers conclude 
that mosquito bites trigger 
Plasmodium to emerge from 
its dormant stage. 

PLoS Pathog. 10,e1004308 (2014) 


MICROBIOLOGY 


Vaginal microbe 
makes drug 


A bacterium that lives in the 
human vagina produces an 
antibiotic, suggesting how the 
microbiome could be mined 
for possible drug candidates. 
Michael Fischbach at the 
University of California, San 
Francisco, and his colleagues 
trained a computer program 
to recognize genes that are 
known to make molecules that 
could be used as drugs, and 
then asked the program to hunt 


RESEARCH HIGHLIGHTS 


THIS WEEK 


Popular articles 
on social media 


SOCIAL SELECTIO 


High retraction rates raise eyebrows 


Amid a wave of recent retractions, researchers are taking to 
social media to discuss a perennial favourite: a three-year-old 
paper looking at the relationship between a journal’s impact 
factor and its retraction frequency. The 2011 report proposed 
a “retraction index’, a measure of the likelihood that a paper 
in a given journal will eventually be pulled from the literature. 
The authors looked at articles published from 2001 to 2010 

in 17 journals and plotted the journals’ retraction indexes 
against their impact factor. The result was clear: the higher 
the impact factor, the higher the retraction index. “You know 
‘high impact’ journals? All that means is that work is more 
likely to be retracted,” tweeted Jon Tennant, who studies 
palaeontology at Imperial College London, earlier this month. 
David Basanta, a cancer researcher at the Moffitt Cancer 
Center in Tampa, Florida, responded on Twitter: “A case 
could be made that more people try to replicate the results.” 
Infect. Immun. 79, 3855-3859 (2011) 


Based on data from altmetric.com. 
Altmetric is supported by Macmillan 
Science and Education, which owns 
Nature Publishing Group. 


for similar genes in the human 
microbiome. 

This yielded thousands of 
genes, including some that 
make a class of antibiotics 
called thiopeptides. The team 
isolated a new thiopeptide 
from a vaginal microbe grown 
in the lab, and found that the 
compound could kill the same 
types of bacterium as other 
thiopeptides. 

This could be the first drug 
discovered in and isolated from 
an organism living in humans, 
the authors say. 

Cell 158, 1402-1414 (2014) 


ECOLOGY 


Sneaky ants steal 
in plain sight 


A recently discovered parasitic 
ant species steals food from 
colonies of another ant by 
disguising itself as the host. 
Scott Powell at George 
Washington University in 
Washington DC and his 
co-workers discovered the 
parasitic ant, Cephalotes 
specularis (pictured right), 
in the Brazilian woodland 
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savannah. C. specularis lives 
only with its host, the highly 
aggressive Crematogaster 
ampla ant (pictured left). 

The researchers found that, 
rather than introduce its brood 
into the host’s nest like other 
parasitic ants, C. specularis 
mimics the body posture of the 
host worker ants to move freely 
around the host's territory. 

The deceptive ant follows the 
host’s pheromone trails to 
locate food, and manages to 
sneak undetected into 89% of 
potential host territories. 

Am. Nat. http://dx.doi. 
org/10.1086/677927 (2014) 
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SEVEN DAYS nscesinn 


Science partnership 
The United Kingdom and South 


Africa signed a multi-year 
science and technology 
partnership on 9 September, 
to be jointly funded with 

£7.8 million (US$12.7 million) 
annually. Research priorities 
include public health, food 
security and technology 
development. The money will 
be administered by the Newton 
Fund, established by Britain to 
support science collaborations 
with developing countries (see 
go.nature.com/yhmmvp). The 
two countries also announced 
three-year partnerships to 
study tuberculosis and 
non-communicable diseases 
in Africa. 


University blaze 

A fire at the University 

of Nottingham, UK, on 

12 September destroyed a 
carbon-neutral chemistry 
laboratory that was under 
construction. Funded in part 
bya £12-million (US$19- 
million) grant from drug giant 
GlaxoSmithKline, the facility 
was slated to open next year. 
The university says that it is 
working with GlaxoSmithKline 
and its contractor to develop a 
plan to rebuild the laboratory. 


Data falsified 

The BMJ Publishing Group 
announced last week the 
retraction ofa June 2013 
article in the British Journal 

of Psychiatry that reported a 
higher incidence of epigenetic 
changes in people with bipolar 
disorder who had experienced 
early-life trauma. An 
investigation by the University 
of Geneva in Switzerland 
found that senior author Alain 
Malafosse had fabricated 

the DNA-methylation data 


Long-lost ship found in Canadian Arctic 


Archaeologists have found one of the ships 
from the Franklin expedition — which 
disappeared in the 1840s — off King William 
Island in the Canadian Arctic. A team led 

by Parks Canada last week discovered either 
HMS Erebus or HMS Terror, Canadian Prime 
Minister Stephen Harper announced on 

9 September. A remotely operated vehicle 
helped to locate the ship’s remains from a sonar 


image (pictured). Since 2008, Parks Canada 
has scoured hundreds of square kilometres of 
ocean floor in search of the ships, which British 
explorer John Franklin commanded while 
seeking the Northwest Passage. Historical 
records suggest that some of the explorers died 
while the ships were trapped in ice, and others 
perished while attempting to walk south. See 
go.nature.com/ugcvuy for more. 


underlying the paper's 
conclusions. A former director 
of genetic psychiatry in the 
university's hospital system, 
Malafosse is also accused of 
embezzling 1.7 million Swiss 
francs (US$1.8 million) in 
government research funds. 


Curiosity arrives 
After more than 2 years and 
9.5 kilometres of driving 
across Mars, NASA’s Curiosity 
rover has reached its ultimate 
goal: a peak called Mount 
Sharp. NASA announced the 
milestone on 11 September 
following the vehicle’ arrival at 
a smoother, less-cratered rock 
formation known to surround 
the mountain. The agency also 
defended the mission against 
recent criticism from a senior 
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review panel, which suggested 
that the project lacks clearly 
defined science goals and 
spends too little time collecting 
and analysing data. 


Bad news for birds 
Many US bird populations 
are declining, largely owing 
to disruption of habitats 

by human development, 
according to The State of the 
Birds 2014, a long-term study 
published on 9 September by 
organizations including the 
Smithsonian Institution in 
Washington DC and the US 
Geological Survey in Reston, 
Virginia. Birds in the arid 
habitats of the western United 
States have been hardest hit, 
with some areas seeing a 46% 
population loss since 1968. 


The report names 230 at-risk 
species, including the Laysan 
albatross (Phoebastria 
immutabilis), a seabird whose 
low-elevation breeding areas 
are threatened by rising sea 
levels. 


Brain-project talks 
The European Union's 
flagship €1-billion 

(US$1.6 billion) Human 
Brain Project has begun a 
‘mediation process’ after 
hundreds of neuroscientists 
fiercely criticized the 
project’s management and 
scientific direction in July 
(see Nature 511, 133-134; 
2014). Germany’s national 
Julich Research Centre said 
on 12 September that its 
board chairman Wolfgang 


PARKS CANADA 


Marquardt will lead the 
mediation, which should be 
completed by mid-2015. 


Stem-cell test 


On 12 September, a woman 
in Japan became the first 
person to receive an 
experimental treatment 
derived from induced 
pluripotent stem cells — 

cells reprogrammed from 
mature tissue to be capable of 
becoming many types of cell. 
See page 287 for more. 


Reef protection 


Australia announced on 

15 September how it intends 
to safeguard the troubled 
Great Barrier Reef, which 
faces threats from climate 
change, coral-eating starfish 
and industrial development. 
The proposed plan outlines 
actions that the national and 
regional governments must 
take over the next 35 years, 
and includes ways to improve 
water quality and biodiversity. 
Conservationists have already 
said that the plan does not 
offer enough protection; the 
proposal is under consultation 
until 27 October. 


Shark cull called off 
Western Australia has halted 
the extension of a controversial 
shark cull, after environmental 
regulators warned ina report 
on 11 September of scientific 


SHEPHERD/ANIMAL AMNESTY 


SOURCE: UNEP/WMO 


Environment Programme 


and the World Meteorological 
Organization. Atmospheric levels 
of ozone-depleting chemicals, 
including chlorofluorocarbons 


— once widely used in 


TREND WATCH 


The ozone layer seems to be on 
the mend, says a 10 September 
report by the United Nations 


refrigerants, and restricted under 


the 1987 Montreal Protocol — 
have declined by 10-15% in the 
past 10-15 years. In parts of the 
atmosphere, ozone levels have 
increased by 5% since 2000. See 


go.nature.com/ozr4np for more. 


uncertainty surrounding 

the programme’ effects on 
the great white shark — a 
protected species. In response 
to seven fatal shark attacks on 
beach-goers between 2010 
and 2013, baited traps called 
drum lines were set up earlier 
this year, catching more than 
172 sharks — most of them 
tiger sharks (pictured). State 
officials had sought to extend 
the programme for another 
three years. 


Swiss diplomacy 


Scientists in Switzerland are 
once more being allowed 

to compete for funds from 
the European Union’s 
Horizon 2020 research 
programme. Switzerland 
was excluded in February 
after it imposed curbs on 
the immigration of citizens 
of the European Union (see 
Nature 506, 277; 2014). The 
European Commission 
announced on 12 September 


OZONE ON THE REBOUND 


that it has negotiated 
temporary exceptions to allow 
applications for some grants 
up to the end of 2016. That 
includes European Research 
Council grants, Marie 
Sktodowska-Curie fellowships 
and specific large projects, 
such as the Human Brain 
Project (headquartered in 
Lausanne) and nuclear-fusion 
programmes. 


Nuclear restart 


Japan has taken its first 

steps towards restarting 
nuclear power generation, 
following the Fukushima 
Dai-ichi plant meltdown in 
2011. On 10 September, the 
Nuclear Regulation Authority 
granted safety approval to 

the Sendai Nuclear Power 
Station, concluding that the 
facility’s two reactors satisfied 
new regulations designed to 
guard against disasters such 
as earthquakes and tsunamis. 
The plant must still clear 
further safety checks and 
obtain local-government 
approval before being turned 
on. Japan’s 48 other reactors 
remain offline. 


EU science leaders 


European Commission 
president-elect Jean-Claude 
Junker announced on 

10 September his nominations 
for the 28 members of the 
next commission. The choices 


Atmospheric levels of chemicals that destroy ozone have 
declined by 10-15% since peaking around 2000. 
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SEVEN DAYS | THIS WEEK | 


21-24 SEPTEMBER 
Two spacecraft are due 
to enter orbit around 
Mars: first NASA's 
MAVEN craft and then, 
three days later, India’s 
Mars Orbiter Mission. 
See page 291 for more. 


23 SEPTEMBER 

A major United Nations 
Climate Summit 

kicks offin New York 
City, intended to 

stoke enthusiasm for 
future international 
negotiations. See 

page 289 for more. 


include former engineer and 
economist Carlos Moedas 
from Portugal as commissioner 
for research, science and 
innovation; and Miguel 

Arias Cafiete, Spain's former 
agriculture and environment 
minister, for climate and 
energy commissioner. The 
nominations must now be 
approved by the European 
Parliament. See go.nature.com/ 
t73tfi for more. 


Stem-cell stock 

On 10 September, the US 
Securities and Exchange 
Commission (SEC) charged a 
prominent stem-cell company 
and a former employee of the 
firm with defrauding investors. 
The SEC says that Gary Rabin, 
former chief executive of 
Advanced Cell Technology 
(ACT) in Marlborough, 
Massachusetts, waited too 
long to notify investors that 

he had sold US$1.5 million 

of the firm's stock between 
2010 and 2012. The company 
is struggling to raise funds to 
support its ongoing research 
(see Nature http://doi.org/q8f; 
2014). ACT and Rabin agreed 
to settle the charges without 
admitting or denying them. 


> NATURE.COM 
For daily news updates see: 
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Masayo Takahashi is the first to implant tissue derived from induced pluripotent stem cells into a person. 


REGENERATIVE MEDICINE 


Japan stem-cell 
trial stirs envy 


Researchers elsewhere can’t wait to test iPS cells in humans. 


BY SARA REARDON & DAVID CYRANOSKI 


I’ve been waiting for this,” says Jeanne 
Loring, a stem-cell biologist at the 
Scripps Research Institute in La Jolla, Califor- 
nia. She is one of several researchers around 
the world to welcome the news that a Japanese 
woman with visual impairment had become 
the first person to receive a therapy derived 
from stem cells known as induced pluripotent 

stem (iPS) cells. 
A lot rides on this trial. If the procedure 


¢C C IB awesome, it’s amazing, I’m thrilled, 


proves safe, it could soften the stance of 
regulatory bodies in other nations towards 
human trials of iPS cells, and it could pave the 
way for treatments for other conditions, such as 
Parkinson's disease and diabetes. It could also 
cement Japan, recently plagued by a stem-cell 
scandal, as a frontrunner in iPS-cell research. 
Pioneered in 2006 by Shinya Yamanaka, now 
director of the Center for iPS Cell Research 
and Applications at Kyoto University, iPS cells 
are created by inserting certain genes into the 
DNA of adult cells to reprogram the cells back 
to an embryonic-like state. The cells can then 


be turned into almost any tissue type, much as 
embryonic stem cells can. But because iPS cells 
can be derived from a patient’s own tissue, the 
hope is that they will dodge some of the con- 
troversial aspects and safety concerns of those 
derived from embryos. 

In 2012, Yamanaka received a Nobel prize for 
his work, and the field has now matured, with 
teams across the world champing at the bit to 
test therapies based on iPS cells in people. Lor- 
ing, for example, uses the cells to create dopa- 
mine-producing neurons as a potential therapy 
for Parkinson's disease, and says that she will 
start clinical trials as soon as the US Food and 
Drug Administration (FDA) gives the go-ahead. 

Still, tissues made from iPS cells carry their 
own concerns, and that had stopped any coun- 
try from approving them for a clinical trial. 
The body’s immune system could attack them, 
or they might contain some cells that are still 
in the pluripotent state and cause cancerous 
growths — although Loring points out that this 
has not happened with human trials of thera- 
pies based on embryonic stem cells, for which 
the same concerns would apply. 


AGREEN LIGHT 

In July 2013, however, Japan’s regulatory 
authorities gave the go-ahead for a team led 
by ophthalmologist Masayo Takahashi at the 
RIKEN Center for Developmental Biology 
(CDB) in Kobe to collect cells to be used ina 
clinical iPS-cell pilot study. 

Her team took skin cells from the first 
patient, a woman in her seventies who had 
retinal damage owing to a condition known 
as age-related macular degeneration. The 
researchers then reprogrammed the skin cells 
into iPS cells and coaxed the unspecialized 
cells into becoming retinal tissue. On 8 Sep- 
tember, Takahashi provided evidence that 
those cells were genetically stable and safe, a 
prerequisite for them to be transplanted into 
the eye. The procedure took place four days 
later, and RIKEN has reported that the patient 
experienced no serious side effects. 

In this instance, the woman’s vision is 
unlikely to improve. However, researchers 
around the world are watching to see whether 
the cells stop the retina from deteriorating 
further and whether any side effects develop. 
Should the woman experience serious conse- 
quences, iPS-cell research could be set back 
years, much as gene therapy was in 1999 when 
a patient died in atrial that attempted to use > 


18 SEPTEMBER 2014 | VOL 513 | NATURE | 287 


© 2014 Macmillan Publishers Limited. All rights reserved 


JUI PRESS/AFP/GETTY 


| NEWS IN FOCUS 


> a modified gene to correct a type of liver 
disease. “That wakes me up at night,” Loring 
admits. 

If Takahashi’s trial succeeds, however, it 
could send a powerful signal to other regulatory 
agencies such as the FDA and the European 
Medicines Agency. “If Masayo can demonstrate 
that these cells are safe in patients, that will have 
calmed some of the anxiety about the new cell 
type out there,” says developmental molecular 
biologist Kapil Bharti at the National Eye Insti- 
tute in Bethesda, Maryland. Bharti is leading 

an effort within the 


“They are sort of US National Insti- 
envious because _ tutes of Health (NIH) 
you can move to develop an iPS-cell 
forwardrapidly therapy for macular 


in Japan.” degeneration using 
an approach similar 
to Takahashi’s. He hopes to apply to the FDA 
in 2017 to begin clinical trials. 

Others are less patient. Stem-cell biologist 
Mahendra Rao, who until recently headed the 
NIH Center for Regenerative Medicine that 
backs Bharti’s trial and is now at the New York 
Stem Cell Foundation, says that regulations 
have been moving too slowly for companies 
outside Japan that want to do similar trials. 
One of these is Q Therapeutics in Salt Lake City, 
Utah, which he founded and which is develop- 
ing cell-based therapies for neurodegenerative 
diseases. “They are sort of envious because you 
can move forward rapidly in Japan,” he says. 

But the Japanese system is also controversial. 
Since approving the Takahashi study, regula- 
tors, keen to stay ahead in stem-cell research, 
have changed the law to make it easier to test 
therapies based on iPS cells clinically, a move 
that some say could result in ineffective treat- 
ments being thrust on desperate patients. 

The surgery offers some welcome positive 
news for RIKEN, and Japan, in the wake of a 
stem-cell scandal and tragedy. “It gives them 
some of the credibility back,’ Loring says. 

Earlier this year, researchers from RIKEN’s 
CDB published two papers in Nature claim- 
ing to have made stem cells through a tech- 
nique known as stimulus-triggered acquisition 
of pluripotency, or STAP. The papers were 
retracted in July, leading to a misconduct 
charge for one researcher, and contributing to 
the suicide of another. The CDB is now being 
halved in size, but in 2016, RIKEN is planning to 
open the ¥3-billion (US$28-million) Kobe Eye 
Center to develop cutting-edge procedures such 
as Takahashi’. And the nation’s new legislation 
means that several other Japanese researchers 
are expected to begin clinical iPS-cell studies 
soon, including Takahashi’s husband, Kyoto 
University’s Jun Takahashi, who is planning a 
trial for Parkinson's disease. 

Outside Japan, many researchers hope that 
Masayo Takahashi’ trial will hasten the trans- 
lation of their own work to therapies. “Every 
time someone goes down this path, it is easier 
for those who are following,’ Loring says. m 


288 | NATURE | VOL 513 | 18 SEPTEMBER 


DESTINATION COMET 


On 11 November, the European Space Agency plans to land a robotic probe, Philae, on the surface of the 
comet 67P/Churyumov-Gerasimenko. Mission scientists meeting over the weekend picked their favourite 
from a shortlist of five sites. The comet is shaped like a rubber duck (seen here from above). 


SITE J Unanimously picked as 
Philae’s target, it is relatively flat and 
boulder-free. It is also well lit and will 
allow for a short descent time, 
saving battery power. 


eo 
: — 


SITE C (not visible) 
Selected as backup, it has 
a view of some of the 
comet's active sites. 


SITE B The ‘heliport’ 
is flat but poorly lit. 


SITE A Overlooks both 
lobes of the comet, but 
has rough terrain. 


SITE! Similar to J, but with 
more slopes, cliffs and hills. 


Lander to aim for 
comet’s ‘head’ 


Touchdown site for Rosetta probe chosen unanimously. 


BY ELIZABETH GIBNEY 


here is no easy way to alight on a 

4-kilometre-long, rubber-duck- 

shaped ice ball that is spinning as it flies 
through the outer Solar System. But scientists 
working on the European Space Agency’s 
Rosetta mission have selected a spot on the 
‘head’ of the comet, called 67P/Churyumov- 
Gerasimenko, that they think will give them 
the best chance for gently landing Philae, a 
washing-machine-sized robotic probe. 

Planned for 11 November, the first soft 
landing ever attempted on a comet is fraught 
with risk. When researchers still thought 
that the object had a regular, potato-like 
shape, they estimated the landing’s chance 
of success at 70-75%. Now that the Rosetta 
orbiter has taken a closer look and revealed 
the curious shape, the odds are lower. Mark 
McCaughrean, a senior science adviser at 
the European Space Agency (ESA) directo- 
rate of science and robotic exploration in 
Noordwijk, the Netherlands, puts them at 
roughly “fifty—fifty”. 

The mission scientists were unanimous in 
their choice of landing spot — a 1-square- 
kilometre patch known as site J] — froma 
shortlist of five (see “Destination comet’). 
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Philae lead scientist Jean-Pierre Bibring of 
the University of Paris-South in Orsay says 
that site J emerged as the favourite after the 
first day of a meeting held on the weekend 
of 13-14 September at the French National 
Centre for Space Studies in Toulouse. “This 
site is not the best for every one of the tech- 
nical and scientific criteria, but overall it’s 
by far the best for mission success,’ he says. 

In a precisely choreographed fly-by, 
Rosetta will release Philae from a dis- 
tance of about 10 kilometres. From there, 
the probe will drift unguided towards 
the target, where it will secure itself with 
harpoons and screws and start work. The 
information that Philae collects about the 
comet’s innards will help to calibrate data 
gathered by the more powerful instruments 
on Rosetta, says McCaughrean. “There are 
many things that we can only do on the sur- 
face, he says. 

A major advantage of site J is that the drop 
from Rosetta will be relatively short, at just 
7 hours. That means that Philae will have 
more battery power to run its instruments 
after landing, because it will take two days 
to recharge using its solar panels. 

The region also has relatively few boulders 
that could capsize Philae on landing. Still, 


ESA/ROSETTA/MPS FOR OSIRIS TEAM MPS/UPD/LAM/IAA/SSO/INTA/UPM/DASP/IDA 


nowhere is devoid of danger. “There is no one 
big Heathrow airport on the surface where 
you can say, ‘No problem,” says McCaughrean. 

Although chosen mainly for technical con- 
siderations, the site is also interesting scien- 
tifically. It is just a few hundred metres from 
two pits that scientists think will become more 
active, spewing out gas and dust, as the comet 
moves closer to the Sun and heats up. The 
position of the landing relative to Rosetta’s 
orbit will also afford the best chance of trans- 
mitting radio waves between the two craft to 


map the comet's interior, says Bibring. 

The mission team says that it reached its 
decision quickly, then spent most of the meet- 
ing’s second day picking a backup — a spot on 
the comet’s body known as site C. 

Other potential backups included a crater 
nicknamed ‘the heliport for its flatness, but 
the site is not as well lit as site C. And a spot 
that would have provided views of the body, 
head and highly active ‘neck’ region had been 
effectively ruled out before the meeting even 
started, says Bibring, because Rosetta would 
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have needed to drop to an orbit that was 
dangerously close to the comet. 

The Rosetta team is rushing to gather as much 
data as possible and to stick to the November 
landing date, because after that increased comet 
activity could damage the orbiter. 

Rosetta has been chasing its quarry for a dec- 
ade. After waking from hibernation in January, 
it arrived at its destination in August and has 
been charting its target from ever-shrinking 
orbits ever since. Rosetta will continue to fol- 
low the comet as it journeys around the Sun. = 


Climate summit previews 
push for new global treaty 


United Nations meeting aims to spark enthusiasm for a 2015 emissions pact. 


BY LAUREN MORELLO 


hen the United Nations Climate 
Summit begins on 23 September 
in New York City, US President 


Barack Obama will be there. But many of his 
counterparts from other major greenhouse- 
gas-emitting countries — including China, 
India, Germany and Australia — plan to stay 
at home. Still, the meeting could offer impor- 
tant clues to how a UN push to forge a new 
international climate pact by the end of 2015 
will play out. 

Approved in 1992 at the Earth Summit in 
Rio de Janeiro, the Kyoto Protocol is the only 
legally binding international treaty governing 
greenhouse-gas emissions. Parts of it expired 
at the end of 2012, and efforts have since been 
afoot to develop a new pact, as demanded by 
the 22-year-old United Nations Framework 
Convention on Climate Change (UNFCCC). 

The New York meeting is not part of the 
formal process to shape a new international 
climate treaty. Instead, the gathering was con- 
ceived by UN secretary general Ban Ki-moon as 
away of marshalling enthusiasm for the effort, 
which is set to conclude in Paris in December 
2015; any agreement would take effect in 2020. 

International climate negotiations have a 
long and chequered history. The Kyoto Protocol 
was never ratified by the United States, did not 
require rapidly developing countries to reduce 
their emissions and was rejected by Canada 
and Russia as the first round of emissions- 
reductions commitments expired in 2012. 

In the meantime, the world’s output of 
carbon dioxide and other heat-trapping gases 
has continued to rise. The level of CO, in the 
atmosphere reached 396 parts per million in 


2013, 42% higher than pre-industrial levels. 
Last year’s was the largest annual increase 
since 1984, according to figures reported on 
9 September by the World Meteorological 
Organization in Geneva, Switzerland. 

Climate negotiators face sticky questions as 
they work to craft a new agreement. Some are 
basic: will the Paris treaty be legally binding? 
Others are more complex and long-standing. 
Many developing nations, for instance, worry 
that carbon cuts will jeopardize their economic 
progress. The challenge is formidable, says 
Nicholas Stern, a climate-change economist 
at the London School of Economics. By 2030, 
Stern says, the world must reduce its green- 
house-gas emissions by roughly 20% from the 
current level to have a chance of limiting warm- 
ing to 2°C above pre-industrial temperatures, 
the UNFCCC’ stated goal. Current emissions 
pledges put the world on track for a 3 °C warm- 
ing by 2100, according to a7 September report 
by PriceWaterhouseCoopers. 

Yet climate-policy experts insist that there 
is reason for optimism heading into the New 
York meeting. “We need to stop judging climate 
action by whether we get a so-called legally 
binding treaty,’ says Paul Bledsoe, senior fellow 
at the German Marshall Fund in Washington 
DC anda White House climate-change official 
under former president Bill Clinton. 

Bledsoe sees signs of meaningful progress by 
major emitters such as China. The country has 
enacted a cap-and-trade programme to reduce 
greenhouse-gas emissions in seven provinces 
and increased its investment in renewable 
energy. The world’s leading greenhouse-gas 
emitter is also considering a ban on coal-fired 
electricity generation in the Beijing area to 
address air-quality concerns. The move would 


be an important step away from the form of 
power generation that produces the most 
greenhouse-gas emissions. 

There are also signs of progress in the United 
States, the second-largest emitter of greenhouse 
gases, says David Waskow, director of the inter- 
national climate project at the World Resources 
Institute in Washington DC. Obama is using his 
executive power to enact policies that reduce 
greenhouse gases — bypassing Congress, 
which has long stymied any plan for emissions 
reductions. Waskow points to Obama's pro- 
posal, unveiled in June, to cut greenhouse-gas 
emissions from existing power plants, which 
produce 38% of the country’s total. The United 

States already regu- 


Last year ’swas lates emissions from 

thelargestannual automobiles and is 

CO, increase working to finalize 

since 1984. limits for new power 
plants. 


Although India’s prime minister, Narendra 
Modi, will not be attending, the stance of India’s 
representatives to the New York summit could 
yield clues to the country’s climate policy. Modi 
took office in May and championed renewable 
energy during more than a decade as chief 
minister of Gujarat state. “It’s a new govern- 
ment, and they haven't had a lot of time to sink 
their teeth into this,” says Jake Schmidt, direc- 
tor of the international climate programme 
at the Natural Resources Defense Council in 
Washington DC. “India is a big unknown.” 

But the true test of progress towards a new 
treaty will not come until March, when nations 
are required to submit their national green- 
house-gas reductions goals to the UNFCCC. 
“That is really the timeline to keep an eye on,” 
says Waskow. = 
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Soft drinks are just some of the many products that use artificial sweeteners. 


NUTRITION 


Sugar substitutes 
linked to obesity 


Artificial sweetener seems to change gut microbiome. 


BY ALISON ABBOTT 


r | Nhe artificial sweeteners that are widely 
seen as a way to combat obesity and dia- 
betes could, in part, be contributing to 

the global epidemic of these conditions. 

Sugar substitutes such as saccharin might 
aggravate these metabolic disorders by act- 
ing on bacteria in the human gut, accord- 
ing to a study published by Nature this week 

(J. Suez et al. Nature http://dx.doi.org/10.1038/ 

nature13793; 2014). Smaller studies have pre- 

viously purported to show an association 
between the use of artificial sweeteners and the 
occurrence of metabolic disorders. This is the 
first work to suggest that sweeteners might be 
exacerbating metabolic disease, and that this 
might happen through the gut microbiome, the 
diverse community of bacteria in the human 
intestines. “It’s counter-intuitive — no one 


2 
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expected it because it never occurred to them 
to look,” says Martin Blaser, a microbiologist at 
New York University. 

The findings could cause a headache for the 
food industry. According to BCC Research, a 
market-research company in Wellesley, Mas- 
sachusetts, the market for artificial sweeteners 
is booming. And regulatory agencies, which 
track the safety of food additives, including 
artificial sweeteners, have not flagged such a 
link to metabolic disorders. In response to the 
latest findings, Stephen Pagani, a spokesman 
for the European Food Safety Authority (EFSA) 
in Parma, Italy, says that, as with all new data, 
the agency “will decide in due course whether 
they should be brought to the attention of panel 
experts for review”. 

A team led by Eran Elinav of the Weizmann 
Institute of Science in Rehovot, Israel, fed mice 
various sweeteners — saccharin, sucralose and 


| MORE NEWS | 
‘Vegetative’ @ Q&A: UK medical charities chief 
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aspartame — and found that after 11 weeks, the 
animals displayed glucose intolerance, a marker 
of propensity for metabolic disorders. 

To simulate the real-world situation of peo- 
ple with varying risks of these diseases, the 
team fed some mice a normal diet, and somea 
high-fat diet, and spiked their water either with 
glucose alone, or with glucose and one of the 
sweeteners, saccharin. The mice fed saccharin 
developed a marked glucose intolerance com- 
pared to those fed only glucose. But when the 
animals were given antibiotics to kill their gut 
bacteria, glucose intolerance was prevented. 
And when the researchers transplanted faeces 
from the glucose-intolerant saccharin-fed mice 
into the guts of mice bred to have sterile intes- 
tines, those mice also became glucose intoler- 
ant, indicating that saccharin was causing the 
microbiome to become unhealthy. 

Elinav’s team also used data from an ongoing 
clinical nutrition study that has recruited nearly 
400 people in Israel. The researchers noted 
a correlation between clinical signs of meta- 
bolic disorder — such as increasing weight or 
decreasing efficiency of glucose metabolism — 
and consumption of artificial sweeteners. 

But “this is a bit chicken-and-egg’ says Elinav. 
“If you are putting on weight, you are more 
likely to turn to diet food. It doesn’t necessarily 
mean the diet food caused you to put on weight” 

So his team recruited seven lean and healthy 
volunteers, who did not normally use artificial 
sweeteners, for a small prospective study. The 
recruits consumed the maximum acceptable 
daily dose of artificial sweeteners for a week. 
Four became glucose intolerant, and their gut 
microbiomes shifted towards a balance already 
known to be associated with susceptibility to 
metabolic diseases, but the other three seemed 
to be resistant to saccharin’s effects. “This under- 
lines the importance of personalized nutrition 
— not everyone is the same,’ says Elinav. 

He does not yet propose a mechanism for 
the effect of artificial sweeteners on the micro- 
biome. But, says Blaser, understanding how 
these compounds work on some species in the 
gut might “inspire us in developing new thera- 
peutic approaches to metabolic disease”. 

Yolanda Sanz, a nutritionist and vice-chair 
of the EFSA’s panel on dietetic products, nutri- 
tion and allergies, says that it is too soon to draw 
firm conclusions. Metabolic disorders have 
many causes, she points out, and the study is 
very small. m 
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Indian Mars craft 
prepares for orbit 


Mangalyaan aims to be Asia’s first successful Martian mission. 


BY SANJAY KUMAR 


ignesh Nair wanted to know the speed 

\ | of the spacecraft; Mayyan Baatish 
asked why India was going to Mars 

at all, given the cost. For once, the 30-odd 
members of the Astronomy Club at the 
G. D. Goenka Public School in a Delhi sub- 
urb were discussing something home-grown: 
India’s Mars spacecraft Mangalyaan, which is 
due to start orbiting the red planet on 24 Sep- 
tember. If all goes according to plan, it will be 
the first successful Mars mission launched by 
an Asian nation — anda point of Indian pride. 

Mangalyaan, known formally as the Mars 
Orbiter Mission, or MOM, was launched 
by the Indian Space Research Organisation 
(ISRO) last November. With 5 scientific instru- 
ments that collectively weigh just 15 kilograms, 
it is designed to image the planet and probe the 
composition of the surface and atmosphere, 
including testing for methane and measuring 
the ratio of deuterium to hydrogen. 

Those are modest goals compared with, say, 
the much larger NASA orbiter MAVEN (Mars 
Atmosphere and Volatile Evolution), which is 
also en route to the red planet. Scheduled to 
arrive just three days ahead of MOM, it has eight 
instruments and would be the first spacecraft to 
examine questions such as how the solar wind 
has stripped away the Martian atmosphere. 

As a result, the anticipation surrounding 
MOM comes not from the science, but from 
what a safe arrival would mean for India. “It 
will be a validation that Indian research and 
development has come of age,” says Amitabha 
Ghosh, an Indian-born planetary scientist 
based in Washington DC. “India is still per- 
ceived as a place where work is outsourced, not 
because of superior science and engineering 
skills but because of a cost advantage.” 

ISRO has launched 35 satellites for countries 
including France, Germany, Canada, Israel and 
Singapore. Success for MOM could boost that 
commercial space industry, says Ajey Lele, a 
research fellow at the Institute for Defence Stud- 
ies and Analysis, a think tank in New Delhi. 

But trepidation still dogs the mission. MOM 
is about to enter a critical period: it has been 
in sleep mode for several months and must 
soon restart and then slow itself down, by fir- 
ing its rockets for about 24 minutes, before 
it can enter Martian orbit. There is only one 


The Mars Orbiter Mission will hunt for methane. 


opportunity for insertion, says A. S. Kiran 
Kumar, who runs ISRO’s Space Applications 
Centre in Ahmedabad. “We are verifying 
everything daily and watching closely for any 
disturbance,’ he adds. 

Ghosh worries that MOM’s development was 
rushed, having taken just 15 months, according 
to ISRO. “A significant gestation period would 
have ensured proper engineering rigour and 
maximized the chances of success,’ he says. 

Scientists also question the choice of rocket 
for the mission. MOM was launched using 
ISRO’s low-power workhorse rocket, the Polar 
Satellite Launch Vehicle (PSLV), designed for 
putting satellites into low Earth orbit. That lim- 
ited the weight of the payload — and, some 
suggest, MOM’s scientific potential, although 
ISRO says that it miniaturized components to 
compensate. ISRO has been working for more 
than a decade and a half on the more power- 
ful Geosynchronous Launch Vehicle, but the 
rocket has suffered a string of failures and 
did not manage to conduct a fully successful 
launch until January this year. 

Whatever happens next, says Lele, India 
has learnt a lot from MOM. Using the light- 
weight PSLV at launch, for example, meant 
that the craft had to take a circuitous route to 
Mars, because it could not achieve the velocity 
needed to travel to the planet directly. “Being 
able to launch a robust spacecraft, overcoming 
minor glitches, sending it towards Mars inno- 
vatively, braving adverse space weather and 
radiation hazards while maintaining reliable 
communication for nearly ten months — it is 
already a significant achievement for India’s 
first deep-space endeavour, says Lele. m 


Additional reporting by Alexandra Witze. 
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Loss of crops to drought contributed to a food crisis in Ethiopia in 2008. 
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Cross-bred crops 
get fit faster 


Genetic engineering lags behind conventional breeding in 
efforts to create drought-resistant maize. 


BY NATASHA GILBERT 


ld-fashioned breeding techniques 
() seem to be leading genetic modifica- 

tion in a race to develop crops that can 
withstand drought and poor soils. 

As the climate warms and rainfall becomes 
more erratic, farmers worldwide will increas- 
ingly need crops that can thrive in drought 
conditions. And the high costs of fertilizers 
— along with the environmental damage they 
can cause — are also pushing farmers to look 
for crop varieties that can do more with less. 

The need for tougher crops is especially 
acute in Africa, where drought can reduce 
maize (corn) yields by up to 25%. The Drought 
Tolerant Maize for Africa project, which 
launched in 2006 with US$33 million, has 
developed 153 new varieties to improve yields 
in 13 countries. In field trials, these varieties 
match or exceed the yields from commercial 
seeds under good rainfall conditions, and yield 
up to 30% more under drought conditions. 

An analysis published earlier this year 
reported that by the project’s end in 2016, 
the extra yields from drought-tolerant maize 
could help to reduce the number of people liv- 
ing in poverty in the 13 countries by up to 9% 


(R. La Rovere et al. J. Dev. Areas 48, 199-225; 
2014). In Zimbabwe alone, that effect would 
reach more than halfa million people. 

The project’s success is due in large part to 
its access to a large seed bank managed by one 
of its partners, the International Maize and 
Wheat Improvement Center (CIMMYT) in 
Mexico City. Breeders from CIMMYT and the 
International Institute for Tropical Agriculture 
in Ibadan, Nigeria, searched the collection 
for maize varieties that thrive in water-scarce 
regions. The researchers cross-bred these 
varieties and then mated the most drought- 
tolerant of their offspring. Several cycles of this 
process led to seed that was better adapted to 
water-scarce conditions. In a final step, project 
scientists cross-bred these plants with varieties 
that have been successful in Africa. 

“Tt is a painstaking and expensive process,” 
says Kevin Pixley, director of CIMMYT’s 
genetic resources programme. 

The CIMMYT researchers established that 
certain characteristics predict how a maize 
plant will fare in drought. One of the most tell- 
ing is the number of days between when the 
plant’s male organs shed pollen and when the 
female silks emerge. When water is scarce, the 
silks emerge late. If the delay is long enough, 
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they emerge after the plants have released their 
pollen and are not fertilized. 

“Finding out this relationship was very 
important to be able to select for drought tol- 
erance,’ says Pixley. By favouring plants with 
shorter intervals between pollen release and 
silk emergence, breeders were able to produce 
maize that was more resistant to drought. 

Drought tolerance is a complex trait that 
involves multiple genes. Transgenic tech- 
niques, which target one gene at a time, 
have not been as quick to manipulate it. But 
CIMMYT and six other research organizations 
are also developing genetically modified (GM) 
varieties of drought-resistant maize, in collab- 
oration with agricultural biotechnology giant 
Monsanto in St Louis, Missouri. Coordinated 
by the African Agricultural Technology Foun- 
dation in Nairobi, the Water Efficient Maize for 
Africa project aims to have a transgenic variety 
ready for African farmers by 2016 at the earliest. 

Like drought resistance, maize’s ability to 
grow in nitrogen-poor soils is genetically com- 
plex, and the need for varieties that do well with 
little fertilizer is pressing. Most African farm- 
ers can afford only one-tenth the amount of 
fertilizer recommended for their crops. This 
is one of the biggest problems they face, says 
Biswanath Das, a maize breeder at CIMMYT. 

Researchers at CIMMYT are working to 
address that problem through the Improved 
Maize for African Soils (IMAS) project, a 
collaboration with the Kenya Agricultural 
Research Institute in Nairobi; the South Afri- 
can Agricultural Research Council in Pretoria; 
and DuPont Pioneer in Johnston, Iowa. The 
10-year, US$19.5-million project is pursuing 
conventional and transgenic approaches. 

Since its launch in 2010, IMAS has developed 
21 conventionally bred varieties. Over the next 
year the project’s leaders hope to commercial- 
ize these varieties and introduce them in eight 
countries. In field tests, IMAS varieties yielded 
up to 1 tonne per hectare more in nitrogen-poor 
soils than did commercially available varieties. 
By contrast, the project's researchers say that 
they are at least 10 years from developing a 
comparable GM variety. 

Conventional breeding will probably have a 
greater impact, says Das, “but it is important 
to consider all options”. = 


CORRECTIONS 

The News Feature ‘Survival of the fittest’ 
(Nature 513, 157-159; 2014) referred to 
the wrong Possession Island. The penguin 
work was done on the French Base d’Alfred 
Faure in the Crozet archipelago. 

The World View by Casparus J. Crous 
(Nature 513, 7; 2014) implied that Saudi 
Arabian scientists on highlycited.com were 
all at a single university. In fact, most were at 
one institution but several came from three 
other universities in Saudi Arabia. 
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Destructive lionfish are invading coral reefs in the Americas, but fishing 
competitions can help to keep the problem species in check. 


BY HANNAH HOAG 


tephanie Green plunged her hands, sheathed in thick 
black gloves, into a cooler full of lionfish. Skilfully avoid- 
ing its 18 venomous spines, she plucked one out and laid 
it on a table to record its length. Nearby, volunteers were 
chopping up the brown, red and white striped fish to 
make ceviche and passing the dish into the crowd. 

As they nibbled on the food, teams of scuba divers 
milled around the scoring area. They checked out each other’s catches 
and argued over who would be taking home the more than US$3,500 
worth of prizes from the 2013 lionfish hunting derby in Key Largo, 
Florida. 

“At the check-in time it’s a mad rush, with teams coming in with 
coolers of fish, trying to beat the clock,” says Green, the chief scien- 
tist of the contest and a marine ecologist at Oregon State University in 
Corvallis. By the end of that day last September, Green and the other 
scorekeepers had counted 707 lionfish, from one smaller than a golf ball 
to one that stretched nearly two soccer balls long. 

The hunting competition is part of an effort to tackle an invasive 
species that has been identified as one of the world’s greatest conser- 
vation issues'. Since lionfish (Pterois volitans) first appeared on the 
eastern seaboard of the United States in the 1980s, the voracious preda- 
tors have gobbled up coral-reef fish from North Carolina to Venezuela. 
Officials responsible for protecting reefs have struggled to find ways to 


294 | NATURE | VOL 513 | 18 SEPTEMBER 2014 


control populations, and managers are embracing these fishing contests 
ina handful of coastal communities. 

The strategy is a bit ofa gamble, given that competitions to catch other 
invasive species — such as pythons in Florida — have had limited suc- 
cess. But the data collected by Green show that even one-day contests can 
effectively knock down local populations. Her findings and those from 
other hunting efforts offer lessons on how a little bit of reward money — 
coupled with science and outreach — can help to keep invasive species in 
check. “We can’t control lionfish in the entire ocean, but derbies can have 
high impacts locally,’ says James Morris, an ecologist with the US National 
Oceanic and Atmospheric Administration in Beaufort, North Carolina. 

Like many invasions, the lionfish conquest started small. The fish 
are normally found in the western Pacific Ocean, Indian Ocean and 
Red Sea, where predators and competitors keep the populations under 
control. Genetic analysis” suggests that roughly a dozen fish were first 
introduced off the Florida coast, either accidentally or intentionally 
released from aquariums. From there, the population exploded. Lion- 
fish spawn almost continuously, releasing 2 million eggs a year, and they 
have few predators or competitors in their new home. 

“At first people thought they were funny, beautiful,” says Mark Vermeij, 
a conservation biologist at the Caribbean Marine Biological Institute on 
the island of Curacao. But opinions changed as the lionfish took over, he 
says. “Quite quickly they were everywhere. They were like cockroaches.” 
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Since they were first spotted near Fort Lauderdale, Florida, in 1985, 
lionfish have colonized more than 4 million square kilometres — 
throughout the Caribbean Sea, the Gulf of Mexico and all along the 
Atlantic coastline of the southern United States — and show no signs of 
relenting. Marine ecologists worry that the invasion will eventually extend 
to Uruguay, stopped only by winter water temperatures. It could become 
one of the most ecologically harmful fish introduc- 
tions in the western Atlantic, says Mark Hixon, a 
marine ecologist now at the University of Hawaii 
at Manoa, and Green’ supervisor at Oregon State. 
At some sites off the coast of North Carolina and 
in the Bahamas, the populations are 5-15 times 
denser than in the fish’s natural range, sometimes 
even reaching 400 fish per hectare. 

The conquest could have profound effects on 
the biodiversity of coral-reef ecosystems. Lionfish 
consume whatever fits in their maws — and a lot 
of it. A DNA analysis’ of the stomach contents of 157 lionfish caught 
in the Mexican Caribbean identified 43 crustacean and 34 fish species, 
including parrotfish, French grunt and graysby — important sources of 
food for local people. Without natural predators, a lionfish can gobble 
up 79% of the juvenile fish on a reef in as little as five weeks. 

The feeding frenzy could also lead to larger problems. Some of the fish 
they prey on clean algae off coral reefs and are already overfished in the 
Caribbean. Without these essential species, algae could outcompete the 
corals. Simulations by Jestis Ernesto Arias-Gonzalez at the Center for 
Research and Advanced Studies of the National Polytechnic Institute, 
in Mérida, Mexico, have shown that a lionfish invasion would decrease 
the biomass of corals in a Caribbean reef by about 10% within ten years’. 


OUT OF CONTROL 

Green did not set out to study lionfish. She had just started a doctorate 
in conservation biology when she travelled to the Bahamas in 2008 
with her adviser Isabelle Coté, a biologist at Simon Fraser University 
in Burnaby, Canada. A student they visited was seeing lionfish all over 
her study sites. “Nobody knew anything about them, the basics of where 
they were, or what they ate,” says Green. 

Green and Cété wondered whether the native fish would return if 
they removed the lionfish. In December 2009, they staked out 24 patches 
of reef and arranged for scuba divers to prune the population of lionfish 
at the sites every month for 18 months. They predicted that the culling 
effort would need to remove 25-92% of the predators, depending on 
the site, to keep them from consuming too much of the prey species. By 
the end of the experiment, the native fish had rebounded by 50-70% in 
the reefs that reached the targeted level of protection’. 

Green and Cété were not the only ones to hunt down lionfish. Earlier 
that year, the Reef Environmental Education Foundation (REEF) in Key 
Largo, Florida, had started running derbies in the Bahamas to increase 
local awareness of the invasion. Green, who had been collaborating with 
the foundation during her PhD, got involved in planning the first hunts. 

She later decided to use the competitions to test whether limited hunts 
could have an impact. With the help of volunteers outfitted in scuba 
and snorkels, Green counted lionfish at 60 sites before and after the 
derbies in Key Largo and the Bahamas in 2012 and 2013. On the basis 
ofa preliminary analysis of the derbies, she says, “there were dramatic 
drops in the densities of lionfish in the sites where people fish?” After 
the competitions, lionfish densities were slashed by more than 60% over 
a 100-150 km” area compared with pre-derby levels. “It’s like pulling 
weeds from your garden,” she says. “You're not going to completely get 
rid of them, but below a certain level, they won't cause problems.’ 

Lionfish recolonized the sites within six months, but the animals 
were significantly smaller, which helped to reduce pressure on the reef. 
Smaller lionfish eat less, prey on smaller fish and produce fewer young. 

Ted Grosholz, a marine ecologist at the University of California, 
Davis, says that the data collected by Green and REEF support the idea 
that derbies can effectively control lionfish populations in selected 


“QUITE QUICKLY THEY 
WERE EVERYWHERE. 
THEY WERE LIKE 
COCKROACHES.’ 
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areas. They also dovetail with results from other lionfish control 
efforts. When the fish invaded the Dutch Caribbean in 2009, volun- 
teers immediately began to use spear guns to remove lionfish from 
the island of Bonaire, but did nothing in neighbouring Curagao. After 
two years of spearfishing, Vermeij and his colleagues found that the 
lionfish biomass in the treated areas of Bonaire was just one-third of 
that in the unfished areas, and about one-quarter 
of what was seen in Curagao®. 


ON TARGET 

The lionfish contests have been much more 
successful than some other efforts that have used 
hunters to control invasive species. In 2013, for 
example, the Florida Fish and Wildlife Conser- 
vation Commission organized the first Python 
Challenge, a month-long event with cash prizes 
that enlisted professional and amateur hunters to 
remove Burmese pythons (Python bivittatus). But the pythons proved 
tough to catch because they are hard to spot in the Florida brush; the 
hunters caught just 68 snakes from a population that is estimated at 
30,000-100,000. 

Jason Goldberg, a biologist at the US Fish and Wildlife Service in 
Arlington, Virginia, says that derbies could be improved by incorpo- 
rating the results of research. Organizers need to calculate how many 
individuals to remove, whether it is better to cull older or larger indi- 
viduals and how their density affects the health of the population. That 
information can then be used to set hunting targets — and prevent the 
kinds of problems that arose when Australia culled red foxes (Vulpes 
vulpes). The 2002-03 Victorian Fox Bounty Trial removed one-fifth of 
the state’s red foxes but ended up boosting the population because the 
survivors thrived when they had less competition for food’. 

Cash incentives can help by drawing amateurs into efforts to control 
invasives. In the Pacific northwest, for example, anglers are offered $4-8 
for every northern pike minnow (Ptychocheilus oregonensis) they cap- 
ture to deter the fish from preying on young salmon. The programme 
has removed more than 3.9 million fish and slashed predation by 40%. 

Goldberg says that research on lionfish derbies should offer insight 
into how often — and when — they should take place at each location. 
He adds that new steps might be needed, such as encouraging commer- 
cial fishing of lionfish to make the species more common in restaurants. 

The lionfish invasion and the success of the derbies has led to pol- 
icy changes in Florida. In August, wildlife regulators relaxed hunting 
restrictions in the state to allow divers wearing rebreathers — devices 
that allow them to remain in the water for longer — to harvest lionfish. 
It will also now allow derby participants to spear lionfish in areas where 
spearfishing is otherwise prohibited. “For marine protected areas to 
function as conservation areas, it’s important that the biology and ecol- 
ogy be conserved to the highest level possible, and that now requires 
lionfish control,” says Morris. 

With her results pointing in a positive direction, Green intends to 
continue analysing data from lionfish derbies, including an event in Key 
Largo on 13 September. When she shares her research findings with the 
divers, it tends to fire them up, she says. “There's this good community 
feeling at the derbies that this is a tool that can have a positive effect and 
help to suppress the invasion.” = 


Hannah Hoag is a freelance writer in Toronto, Canada. 
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RIDE IN SCIENCE 


The sciences can be a sanctuary for gay, lesbian, bisexual 
and transgender individuals, but biases may 
still discourage many from coming out. 


BY M. MITCHELL WALDROP 


was the golden child,’ says Justin Trotter, 
thinking back to his teenage years liv- 
ing near the Kennedy Space Center in 
_ Brevard County, Florida. The handsome, 
articulate son of a devout Mormon family, he 
earned top grades, assembled winning projects 
for science fairs and worked in university labo- 
ratories from the age of 14. 
But he was also wrestling with a secret. Trot- 
ter, now a neuroscience postdoc at Stanford 
University in California, says that as early as 


the ages of 11 or 12 he had begun to sense that 
he was attracted to boys — a feeling that he 
had always been taught was shameful. So all 
through his teens and early twenties, he says, 
he struggled to keep his sexuality hidden, to 
appear masculine, to blend in. 


DIVERSITY 


ANature and Scientific American 
special issue nature.com/diversity 
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“T dreaded dealing with it,” says Trotter. By 
his undergraduate years at university he was 
suffering from exhaustion, depression and 
panic attacks. “My only escape was to work 
in the lab,” he says: “It was my haven.” But 
the stress took its toll even there. “I felt my 
memory wasn't good. I wasn't as sharp as I 
could be.” 

It was not until the last two years of his grad- 
uate studies, at the University of South Florida 
in Tampa, that Trotter finally came out, con- 
fiding to a few close friends that he was gay. 
As the word spread, he found his depression 
lifting. His energy improved. His work became 
more focused. 

“When I felt I could just be who Iam, a full 
person,’ says Trotter, “then it was definitely 
good for the science.” 

That message is being heard in more and 
more laboratories and research centres around 
the world. People who identify as lesbian, gay, 
bisexual or transgender (LGBT) have long 
faced discrimination or worse: they are still 
considered outcasts or even outlaws in most 
Muslim nations, as well as in Russia and parts 
of Asia. But attitudes are changing. Accord- 
ing to a survey published last year by the Pew 
Research Global Attitudes Project, openly 
gay individuals have high levels of public 
acceptance across broad swathes of Western 
Europe, Australia, Canada and Latin Amer- 
ica (see ‘Degrees of acceptance’). Nowhere is 
this change more visible than in the United 
States, home of the world’s largest research 
enterprise, where public attitudes are shifting 
towards acceptance of LGBT people faster 
than in almost any other nation. Courts and 
legislatures are lifting restrictions on same-sex 
marriage in state after state, often in the face 
of vehement opposition from social conserv- 
atives, and LGBT equality has emerged as a 
dominant civil-rights issue. 

“This is an important time in history for the 
LGBT community,” says Trotter — not unlike 
the period several decades ago when women 
and under-represented ethnic minorities 
began their push for greater recognition in 
science. Just as those groups once did, LGBT 
researchers are trying to seize the moment by 
creating an infrastructure of organizations 
and interest groups geared towards helping 
one another with information, support and 
networking (see Nature 505, 249-251; 2014). 


OUT IN THE OPEN 
In this newly open environment, LGBT 
scientists are finding it easier to declare them- 
selves — or at least, to think about doing so. 
“Tm getting a constant stream of e-mails from 
young scientists: ‘Can I meet with you?’,” says 
Ben Barres, a Stanford neuroscientist who 
transitioned from female to male in 1997, and 
who has become a prominent spokesman for 
LGBT issues in science. 

But just as for ethnic minorities and women, 
there is still a long way to go. Many LGBT 


scientists fear coming out — if only because 
publications, career progression and promo- 
tion are based heavily on the judgement of 
fellow scientists, which might be influenced 
by conscious or unconscious bias. And many 
students may be avoiding a research career 
entirely — although no one knows, because 
no one has counted. 

“IT worry that there is a vast pool of talent 
that might be being lost to science,” says Trot- 
ter. The only way to change that, he says, is 
for the scientific community to reach out 
to its LGBT members, and have an honest 
conversation. 

The lab can be an excellent place for that 
dialogue, says Kale Edmiston, a neuroscience 
graduate student at Vanderbilt University in 
Nashville, Tennessee. “The cool thing about 
scientists is that we try to withhold judge- 
ment and gather information,” he says. That 
is exactly what happened when Edmiston 
began transitioning in 2010. He told every- 
one in his research group that he would be 
taking hormones and that his appearance 
would change. They responded with a com- 
bination of sympathy, interest and curios- 
ity. “A lot of my peers and colleagues have 
really listened and heard what I’m saying,’ 
says Edmiston. Rachael Padman had much 
the same experience in the early 1980s, when 
she transitioned from male to female while a 
graduate student in astrophysics at the Uni- 
versity of Cambridge, UK. “One colleague 
was never able to say ‘she’ instead of ‘he’,” 
says Padman. “But that was just one in the 
last 35 years. From almost everyone else, I 
just never get any vibes at all” 

The mores of research can work the other 
way, too, says Vivian Underhill, a field 


“WHEN T FELT 1 COULD JUST 
BE WHO TAM, A FULL PERSON, 
THEN TT WAS DEFINITELY GOOD 

FOR THE SCIENCE.” 


hydrologist with the University of Colorado 
Boulder’s Institute for Arctic and Alpine 
Research, and author of ‘Queered Science: a 
blog series about LGBT researchers. “As scien- 
tists we like to think that we're objective,’ she 
says — that personal and social issues should 
be kept separate from the real work. “And 
mostly that’s a good thing,” she says, but too 
often it leads people to assume that they can 
eliminate biases by not talking about them. 
“That just enables the fear to propagate.” 

It can also make it hard to comprehend the 
stark loneliness that comes with being LGBT 
in a majority-straight world. Unlike women 
or ethnic minorities, LGBT people are not 
automatically born into a peer group, says 
Darrin Wilstead, director of operations for 
the Point Foundation, an LGBT scholarship 
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and mentoring fund based in Los Angeles, 
California. Almost always, he says, “they come 
home to a family that does not share their iden- 
tity” — and may not understand or accept it. 
Everyone has to come to terms with their sexu- 
ality as they grow up. But LGBT individuals 
often have to begin that journey in isolation. 

Gay, lesbian and bisexual feelings typi- 
cally emerge around the time of puberty 
— although they may start much earlier. 
“When you're a kid, you just aren't as aware 
of who you're attracted to,” says Eli Capello, 
an undergraduate neuroscience major at 
Centenary College of Louisiana in Shreve- 
port. Transgender issues, by contrast, can 
become obvious at a very early age. “I knew 
something was up when I was 3 and 4,” says 
Capello, who transitioned when he was 18. “I 
just didn’t know what?” 


GROWING PAINS 

Many people lack basic knowledge about 
gender identity, which is different from 
sexual orientation. The latter concerns who 
a person is attracted to; gender identity is 
about the body someone is born into and 
whether it matches what the brain is insist- 
ing. Either way, LGBT individuals typically 
find themselves struggling to deal with all 
this in their teens and early twenties — pre- 
cisely when science students are also sup- 
posed to be mastering their fields. Some 
respond by throwing themselves into their 
coursework. “College was a great place to dis- 
tract myself? says Underhill, who did not tell 
close friends that she was lesbian until just 
before her graduation. “I didn’t even allow 
myself to look for online sources that might 
have been helpful, because I didn’t want to 
go there” 

But the kind of emotional turmoil that Trot- 
ter describes is also very common. According 
to the US Centers for Disease Control and Pre- 
vention in Atlanta, Georgia, gay, lesbian and 
bisexual teenagers generally experience high 
levels of bullying and drug abuse, and are more 
than twice as likely to attempt suicide as their 
heterosexual peers. The lack of data means 
that there is no way of knowing how often 
this leaves promising students too stressed to 
attempt challenging science, technology, engi- 
neering or mathematics (STEM) degrees. But 
anecdotal evidence suggests that it does hap- 
pen. At the Point Foundation, says Wilstead, 
“we found out that some areas of studies, like 
law, medicine and STEM, were much more 
challenging” for scholarship recipients trying 
to maintain their grades. And that, in turn, 
may help to explain why only about 10% of the 
foundation's applicants were in STEM fields 
until five years ago, when the foundation began 
vigorous outreach efforts that helped to raise 
the fraction to around 20%. 

Adding to the loneliness and stress is the 
highly charged decision of whether to come 
out at all. To stay in the closet perpetuates the 
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In 2013, as part of a survey on global attitudes, the Pew Research Center in 
Washington DC asked people in 39 countries: "Should society accept 
homosexuality?" (It did not ask about transgender people.) 


DEGREES OF ACCEPTANCE 


Attitudes in Russia have 
shifted against gay men and 
lesbians as discriminatory 
laws have been passed in 
recent years. 


Public acceptance is 
relatively high in the 
United States. Still, 
same-sex marriages 
are illegal ina 
majority of states. 


France has seen 
recent protests against 
gay marriage as part 
of a broader rise in 
conservative views. 
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TOLERANCE AND RELIGION 

The Pew data show that tolerance tends to be low in countries with high 
‘religiosity’, a measure of how much importance people place on prayer, 
ritual and belief. Countries with low religiosity tend to have high acceptance. 
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turmoil, but the consequences of not staying 
there can be awful. After Capello came out 
as transgender at age 16, for example, his 
relationship with his family deteriorated 
to the point at which he had to leave home 
and attend a boarding school paid for by his 
grandmother. 

Jun Ding, a Stanford neuroscientist who 
grew up in Shanghai, China, before moving 
to the United States for his PhD relates a dif- 
ferent kind of experience. Although Chinese 
laws that were used to discriminate against 
gay people were repealed in 1997, there are no 
protections and very little public discourse on 
the subject. Ding says that his parents still do 
not understand when he tries to explain that 
he is now living with his husband, whom he 


married under California law. “It’s not like the 
US, where almost every movie has a gay role,” 
he says. “In China, a lot of people just don't 
have the concept of gay life” 


PEER PRESSURE 

Even if family rejection is not a concern — and 
it seems to be less common than it once was, 
says Wilstead — the decision is not necessar- 
ily any easier. More than in almost any other 
field, a researcher’s career is based on peer 
review in the widest sense, says Eric Patridge, 
a chemist at Yale University in New Haven, 
Connecticut, and president of Out in Science, 
Technology, Engineering, and Mathematics, 
a national LGBT student group. Colleagues’ 
opinions weigh heavily when it comes to 


funding, collaboration, publication, hiring, 
promotion and almost every other decision. 
Ina highly competitive environment, every 
LGBT researcher has to worry that coming 
out will trigger unconscious biases that could 
ruin his or her chances. Studies from the US 
National Institutes of Health over the past 
few years suggest that such bias may well be a 
problem for other minorities (see Nature 512, 
243; 2014), and there is no reason to think that 
LGBT researchers are exempt. 

That may be why Barres hears from many 
young LGBT scientists who are afraid to 
come out, even in the San Francisco Bay Area 
of California, historically one of the most 
tolerant regions in the United States. And 
it is worse in the more socially conservative 
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regions of the country: “When I talk to peo- 
ple from down south, the fears are so much 
stronger,’ he says. 

Those fears are justified, says Trotter. After 
he came out during his graduate studies in 
Florida, he says, some of the more religious, 
socially conservative students in his research 
group became noticeably reticent around 
him. “The scientific community is still com- 
prised of people at varying stages of social 
progress in ideas,” he says — a reality that 
he has not escaped. He is very comfortable 
at Stanford, he says. But once his postdoc 
appointment there is over, he may well have 
to apply for tenure-track jobs back in Florida 
or another less-tolerant part of the country 
— “where my ability to acquire tenure or run 
a successful research programme would be 
in question”. 

And that is just in the United States. Scien- 
tists and science students seem to be equally 
reticent in LGBT-friendly Western Europe — 
and even more so in China. In many parts of 
the Middle East and Africa, moreover, LGBT 
activity is punishable by law — in some cases, 
with execution (see Nature 509, 274-275; 
2014). “So if youre a chemist or geologist for 
an oil company, youd better be in the closet if 
they send you to one of those countries,’ says 
Rochelle Diamond, who chairs the board of 
directors of the National Organization of Gay 
and Lesbian Scientists and Technical Profes- 
sionals in Pasadena, California. 


DIFFICULT CONVERSATION 

Coming out can be even more daunting 
for transgender people — especially if the 
announcement coincides with the start of 
sex-reassignment treatments. Added to the 
emotional stress and professional concerns 
are the physical effects of interventions such 
as taking hormones. “There’s a reason youre 
supposed to go through puberty before you 
get to college,” says Capello. Kate Forbes, who 
now works for a medical information-tech- 
nology company, transitioned while she was 
earning her doctorate in ecology at the Uni- 
versity of Wisconsin-Madison; she describes 
nights curled up in pain on her futon after 
electrolysis treatments to get rid of the hair 
on her legs. 

Then there are the awkward conversations. 
“Every time Id start a course I would have to 
have a very personal discussion with the pro- 
fessor about things like male pronouns,” says 
Lucas Cheadle, a neuroscience postdoc at Har- 
vard University in Cambridge, Massachusetts. 
Making the situation doubly difficult was that 
he transitioned while he was an undergrad- 
uate at Smith College — a women’s university 
in Northampton, Massachusetts. “I missed out 
on a lot of mentorship relations because of the 
difficulty of explaining,” he says. 

Even ordinary paperwork can become a 
major burden. Newly transitioned individu- 
als can spend endless hours struggling to 


convince sceptical bureaucrats that they have 
a legitimate claim to their own university 
transcripts, publication lists, birth certifi- 
cates, driving licences, credit cards and more 
— all now in a different name. Even in toler- 
ant Western Europe, officials may demand 
extensive documentation before making 
the required changes, says Padman. “Britain 
is quite unusual in that respect,” she adds: 
the Gender Recognition Act 2004 declares 
that each individual is the gender he or she 
decides to be. “They can get a new birth cer- 
tificate with the new gender, and have all the 
legal rights of the acquired gender.” Backing 
that up is the Equality Act 2010, which for- 
bids discrimination again trans people. “It dis- 
courages the tabloids from making a huge deal 


“THE SCIENTIFIC COMMUNITY 

[S STILL COMPRISED OF PEOPLE 

AT VARYING STAGES OF SOCIAL 
PROGRESS IN IDEAS.” 


about someone’s sex reassignment on the front 
page, the way they used to,’ says Padman. 

The United States has certainly not gone 
as far towards recognition as Britain. But 
even so, the situation seems to be improving 
rapidly for younger LGBT people — not least 
because of the Internet. “When Point was 
founded 13 years ago,’ says Wilstead, “our 
scholars were saying that they couldn't find 
anyone who was older, gay and successful.” 
But today, thanks to Facebook, Twitter and 
the plethora of other social-media sites, it is 
much easier to make such connections. And 
one consequence, says Wilstead, is that LGBT 
people are coming out much earlier than they 
used to. Jack Andraka, for example, was a 
15-year-old student in Crownsville, Mary- 
land, when he won the grand prize at the 2012 
Intel International Science and Engineering 
Fair, for discovering a test for pancreatic, 
ovarian and lung cancer. He had come out as 
gay when he was 13. 

Another consequence is an increased sense 
of solidarity in the LGBT community itself. The 
group was once defined only by what its mem- 
bers are not: straight. “Aside from that,’ says 
Underhill, “my experience as a white lesbian 
woman may have little in common with that 
ofa black gay man.’ That is a point echoed by 
many others: their ‘community is often riven 
by the same fault lines as the society around 
it, with gay white men dominant, and women, 
bisexuals and ethnic minorities each feeling 
marginalized in their own ways. And only in 
the past five years or so have transgender indi- 
viduals begun to become more visible. 

But the rising generation tends to be much 
more concerned about inclusion, says Wil- 
stead. Witness the embracing of the once- 
derogatory term ‘queer’. “It’s an umbrella 
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term, he says. “It’s basically saying, ‘I am just 
different” That openness and solidarity, in 
turn, is making it much easier for young sci- 
entists to find mentors and role models. It is 
impossible to overemphasize how important 
that is, says Trotter — “seeing that it really does 
get better, seeing gay and lesbian scientists who 
have been through it and made it”. 

But the scientific establishment could give a 
lot more help in promoting role models than 
it has so far, say LGBT activists. Barres thinks 
that the US National Academy of Sciences 
missed a golden opportunity when he was 
elected to membership last year. “I asked if they 
would include in the announcement that I was 
the first transgender scientist to be elected,” 
he says. “They never did.” (Electrical engi- 
neer Lynn Conway was elected to the paral- 
lel National Academy of Engineering in 1989, 
although she did not come out as transgender 
for another decade.) 

The scientific establishment could also do 
a lot more about collecting basic data. For 
example, the US National Science Founda- 
tion, which compiles detailed statistics about 
women, under-represented minorities and 
the prevalence of various disabilities among 
US researchers and STEM students, does not 
currently ask about LGBT identification. 
Nor do there seem to have been systematic, 
large-scale studies of the social environment 
for LGBT researchers. How much stress do 
they really feel if they stay closeted in the lab? 
What are the actual effects on their health 
and productivity? And if they do come out, 
are they really less likely to be funded, hired 
or promoted? At least one team — sociolo- 
gists Erin Cech of Rice University in Hou- 
ston, Texas, and Tom Waidzunas of Temple 
University in Philadelphia, Pennsylvania — is 
hoping to carry out a survey of 2,000-3,000 
LGBT scientists and engineers, but has yet to 
get funding. 

Without such data, says Trotter, it is impos- 
sible for the funding agencies to know whether 
LGBT people are over- or under-represented 
in the research fields, whether there is a need 
for more support programmes and counselling, 
or whether they should offer special fellowships 
for young LGBT researchers in the way they 
now do for women and minorities. “We don't 
have numbers,’ says Trotter, “and that’s frustrat- 
ing for us as scientists.” 

Still, without minimizing the challenges 
that remain, older LGBT scientists stress how 
far the world has come in a remarkably short 
time. “When I’m contacted by young people,’ 
says Barres, “I always tell them that the fears 
are so much greater than the reality. And I 
always encourage them to be open, because 
they will be so much happier. If you're doing 
good science, if you're a great teacher — that’s 
what matters.” = 


M. Mitchell Waldrop is a features editor for 
Nature in Washington DC. 
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Medical School in Boston, Massachu- 

setts, an African American teenager 
was found dead just blocks away from the 
teaching hospitals. He had died of an asthma 
attack while clutching his inhaler. 

It is widely known that racial and ethnic 
minorities in the United States have higher 
rates of diseases such as asthma’ and can- 
cer’, and receive worse care’. Compared with 
white people with similar conditions, minor- 
ity individuals get fewer heart bypasses and 
influenza vaccinations. 

Less well known is the fact that many 
drugs work better in people of European 
origin than in others. One class of asthma 
drug (long-acting B2-agonists) is even 
associated with higher mortality in African 
Americans’. 

Populations of non-European descent 
are harmed because they are not studied 
as intensely, and clues that could reveal 
new aspects of disease biology are missed. 
Including diverse populations in clinical and 
biomedical research is a must, ethically and 
scientifically. Research infrastructure needs 
to be retooled accordingly. 

My mother was Mexican, an overworked 
single parent who learned English and put 
herself through university. I spent much 
of my time growing up with a Chinese 
surrogate family. My wrestling coach, an 
African American and member of the 1984 
US Olympic team, became my mentor and 
father figure. Later, in medical school, I lived 
in housing that was set up by Jewish stu- 
dents. These experiences have prompted me 
to consider health disparities across racial 
and ethnic populations, which I discuss here 
using the terms and criteria established by 
the US Centers for Disease Control and Pre- 
vention (CDC; see go.nature.com/a2euvo). 

The year that the young man in Boston 
died, my colleagues and I identified a vari- 
Common asthma drugs can work less well for children of some ethnicities. ant associated with asthma in the gene for 
interleukin 4, a cell-signalling protein that 

e e e coordinates immune and inflammatory 
NN NN responses. In our study” of 772 individu- 
| S S | g p ad 1e S als, the variant was associated with lower 


levels of lung function, leading to more 
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Effective clinical studies must consider all 
ethnicities — exclusion can endanger populations, 
says Esteban G. Burchard. 
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> severe disease in white people. Although 
black children are more likely than white 
children to be diagnosed with and die from 
asthma (see Asthma inequalities’), few black 
patients were included in the study, so we 
had insufficient statistical power to establish 
the genetic association in black people. How- 
ever, our analysis found that the variant was 
40% more common in black people. This led 
me to wonder whether some health dispari- 
ties might result from genetic differences, as 
well as social and environmental factors. 

That same year, the CDC published data 
showing that the occurrence of and deaths 
from asthma were threefold higher in His- 
panic communities in the northeastern 
United States than in those on the west 
coast. immediately thought that the obser- 
vation could result from genetic differences 
between Puerto Ricans (concentrated in the 
east of the country) and Mexicans (concen- 
trated in the west). This realization spurred 
the Genetics of Asthma in Latino Americans 
(GALA) study, which started in 1998 in Bos- 
ton, New York and San Francisco, California. 
For one analysis, children with asthma were 
asked to breathe into a measuring appara- 
tus after receiving standard treatments. The 
research® showed that the biggest predictor 
of drug response was ethnicity — stronger 
than age, sex or disease severity. Commonly 
prescribed asthma medications worked less 
well for Puerto Ricans than for Mexicans and 
African Americans. 


Such disparities “Grant 
occur across other applications 
ethnicities andcondi- should be 
tions. Heart disease regarded more 
and stroke arethetwo favourably, 
leading causes ofdeath not less, for 
worldwide, and the analysing 
blood thinner clopi- minority 


dogrel is widely pre- 
scribed to people who 
have had a heart attack or a stroke. In March, 
officials in Hawaii sued the drug's manufac- 
turer for failing to disclose that it is frequently 
ineffective in the state’s largely east Asian and 
Pacific Islander population, placing them at 
higher risk of recurrent heart attacks. 


populations.” 


VICIOUS CYCLE 

Many barriers keep studies that could charac- 
terize such disparities from being proposed, 
funded, carried out and published. The hospi- 
tal where I work runs dozens of clinical trials, 
but it serves mainly people of European and 
Asian descent. To recruit enough subjects for 
the GALA study we sent staff to other parts 
of the San Francisco Bay Area, to Mexico and 
to Puerto Rico. We established a network of 
physicians with experience serving diverse 
communities, used bilingual clinical coordi- 
nators and engaged community clinics, reli- 
gious leaders and community activists. That 
Iam a bilingual physician-scientist from an 


ASTHMA INEQUALITIES 


Genetic variants contribute to disparities in 
asthma incidence and treatment efficacy. 
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ethnic minority was invaluable for brokering 
these connections. 

Once participants were recruited, we 
measured genetic ancestry using reference 
data from the 1000 Genomes Project and the 
Human Genome Diversity Project. This was 
not easy — fewer than 4% of genetic associa- 
tion studies have been conducted in people of 
non-European descent. We had to create our 
own human genetic reference panels by team- 
ing up with another investigator who had col- 
lected samples from Native Americans. 

Our work paid off. We were able to show 
that considering genetic ancestry can 
improve the accuracy of diagnosis of lung 
disease in African American and Mexican 
populations. We have also identified genetic 
variants that might explain why asthma 
drugs work less well for Puerto Rican and 
African American children. Clinical trials 
are now under way to assess the efficacy 
of asthma medications in different ethnic 
populations, based on genetic variants. 

Such work, focused on minority popula- 
tions, faces a vicious cycle. As a reviewer for 
the US National Institutes of Health (NIH), 
which is funded by US taxpayers, I witnessed 
how grant applications that propose genetic 
analyses in minority populations in the 
United States are criticized because reviewers 
considered these populations more difficult 
to analyse than more-genetically-homoge- 
nous European populations. Sadly, I believe 
that many NIH reviewers see rich genetic 
ancestry largely as a potential confounder. 
They do not appreciate that it can be lever- 
aged to reveal new risk factors. 

Publishing such results is also difficult. 
Most high-impact journals require that 
an association be found in samples from 
two independently recruited studies. This 
demand is straightforward in European pop- 
ulations, because many banked samples exist. 
It is much harder to meet for other groups. 

Disparities are self-perpetuating. 
Minority scientists are often best placed 
to gain community ‘buy im and trust in 
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minority populations, but these scientists are 
at a disadvantage in other ways. According 
to one analysis, black scientists in the United 
States were 13% less likely to get NIH fund- 
ing than white researchers’. In short, inves- 
tigators who want to focus on minorities face 
extra challenges. 


COUNT EVERYONE IN 

The NIH Revitalization Act of 1993 mandated 
that NIH-funded research must include 
minorities. Twenty-one years later, diversity- 
focused clinical research is still the exception, 
not the rule. Although black people and other 
minorities in the United States have greater 
rates of and mortality from cancer than 
white people’, they are generally less likely 
to be enrolled in clinical trials. Of the 10,000 
clinical trials funded by the National Cancer 
Institute since 1993, only around 150 studies 
focused on racial or ethnic minorities®. 

Such gaps and their effect on health care 
must be assessed. Funding agencies should 
do more to collect evidence on what research 
is needed, promote research training, and 
provide venues for discussion of dispari- 
ties in biomedical research. At a minimum, 
the race and ethnicity of study participants 
should reflect the population with the dis- 
ease being investigated. Grant applications 
should be regarded more favourably, not 
less, for analysing minority populations. 
Journals should require appropriate repre- 
sentation and analyses before publishing 
clinical studies. 

Investigators must also form partner- 
ships with physicians and residents in 
under-represented communities — they too 
have a vested interest in improving studies. 
Finally, there must be increased recruit- 
ment of minority physicians and scientists 
and mechanisms to enhance their training 
and retention. 

At every stage of the scientific discovery 
and review process, investigators should keep 
in mind that ancestry can contribute to dif- 
ferences in disease and drug response. To do 
otherwise is to ensure worse health for us all. m 


Esteban G. Burchard is professor of 
bioengineering and therapeutic sciences, and 
of medicine, at the University of California, 
San Francisco, USA. 

e-mail: esteban. burchard@ucsf.edu 
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A researcher works in a malaria-drug laboratory in the Democratic Republic of Congo. 


Discuss inequality 


Confront economic differences to strengthen global 
research, urge P. Wenzel Geissler and Ferdinand Okwaro. 


ealth research in Africa operates 
H across vast economic and political 
inequalities. These shape collabo- 

rating scientists’ work and lives, running 
counter to ideals of equal partnerships. 
Most scientific staff from ‘the south (as cur- 
rent euphemism circumscribes impover- 
ished countries in sub-Saharan Africa) are 
employed by local institutions or on fixed- 
term contracts. Although these researchers 
have advantages over non-collaborating local 
colleagues, they have lower salaries than their 
northern counterparts and smaller allow- 
ances for health, housing, retirement or their 
children’s education. Yet when expatriate and 
local scientific workers depart after the work- 
day for opposite sides of town, which might 
retain overtones of class and colonial differ- 
ences, such inequalities go without mention. 
This is a problem. Unacknowledged ine- 
qualities cause misunderstandings and irri- 
tation. Consider young African researchers 
at an international conference, sleeping and 
eating outside the five-star venue and miss- 
ing informal scientific exchanges to save their 
allowances. To naive colleagues they might 
seem uncommitted. Or consider a southern 
principal investigator hired to run part ofa 
multisite trial, who must cover the substantial 
security, housing and school bills of expatri- 
ate colleagues within a limited local budget. 
As ethnographies of transnational research 
show, this is not simply the trope of exploited 
southern scientists and dominating northern 
ones; it is the reality of global health science’”. 
We use the term ‘unknowing’ for the 


tendency to avoid straightforward talk about 
obvious inequalities’. Unknowing can be 
about participants’ poverty or malfunctioning 
public health facilities, or be between scien- 
tific staff. Discourse is rarely silenced overtly 
— although speech about inequality is often 
limited to private chats between peers. Euphe- 
misms — including ‘partnership; ‘south’ and 
‘north — deflect attention from material real- 
ities. So do practical arrangements: discussing 
or comparing salaries openly would reveal the 
systemic challenge of addressing radically dif- 
ferent pay rates, so researchers instead negoti- 
ate individually for ‘reimbursement or ‘per 
diems for field days. 

The subject is raw. Calling out inequal- 
ity reveals our limited ability to do the right 
thing. Northern colleagues in particular 
sometimes feel that such talk is unfair. As 
progressive researchers working to improve 
others’ lives, they resent comparisons, even 
unintended, with exploitative colonialists*. 

In large collaborative sites, open discussion 
threatens established routines and substantial 
investments. When disparities seem intracta- 
ble, talking about them can feel paternalistic, 
rude, whiny, self-righteous — or pointless. Yet 
unknowing also subdues essential scientific 
discussion. The ‘normal emergency’ state 
of African health systems often requires 
research into context-specific tools — adapted 
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reference values for nutrition or toxicity, 
surgical-safety procedures in understaffed 
theatres and diagnostic procedures substi- 
tuting for standard tests’. Polite reluctance to 
acknowledge actual health-care conditions, 
insisting instead on universal standards 
and research themes, can produce ‘world- 
class’ research that fails to address real-world 
concerns and opportunities. In turn, this can 
instil a sense of futility in local scientists®. 

Divergent interests and needs, if ignored, 
find destructive outlets — such as workplace 
talk of racism or colonialism, or conflicts 
fought under the guise of ethical review 
procedures or even outright legal battles. In 
short, unexpressed ‘us and then’ thinking 
erodes research quality. Frank discussion 
about money, opportunity and leadership 
is not, as it sometimes is denigrated, just 
‘politics’ It is essential. 


IT’S GOOD TO TALK 
That said, articulating inequalities between 
unequal parties is difficult. Such conversa- 
tions may be launched by those who seek 
redress, but it should also be initiated by 
leaders or with the aid of social scientists. 
This year, we presented fictional cases 
about everyday research ethics and inequal- 
ity in a number of sites in east Africa and in 
European collaborating institutions (see 
africanbiosciences.wordpress.com). Most 
participants — especially local collaborators 
— appreciated the chance to acknowledge 
inequalities publicly. One presentation initi- 
ated the first open discussion about external 
research in a hospital that had collaborated 
with a European partner for decades; it 
yielded suggestions for improved commu- 
nication and requests for specific diagnostic 
resources. Even after inconclusive arguments 
or intense conflicts about the ‘realisn’ of man- 
agers and ‘radical’ demands by junior staff, 
participants remarked how good it was to talk. 
Collaboration under conditions of 
inequality is not comfortable. Talking about 
these realities can foster shared goals, more- 
equal relationships and better science. m 


P. Wenzel Geissler is professor of social 
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Division of Social Anthropology, University 
of Cambridge, UK. Ferdinand Okwaro is 
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Studies, University of Cambridge, UK. 
e-mails: p.w.geissler@sai.uio.no; 
fokwaro@gmail.com 
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COMMENT 


Tailor informed- 
consent processes 


The first step in studying mental-health interventions across cultures is 
to adjust procedures to participants’ needs, says Monica Ruiz-Casares. 


he enduring mental-health con- 

sequences of armed conflict, natural 

disasters and forced migration are 
increasingly recognized. But clinicians and 
the people they help often come from dif- 
ferent backgrounds, each unfamiliar to the 
other’. This can result in tensions, inappropri- 
ate health services and misleading research. 

It is hard to assess how well existing 
scientific evidence for effective mental-health 
care applies to any group under-represented 
in mental-health research. That evidence is 
based mainly on adult, Western populations. 
Refugees, migrants and young people from 
other cultures are much less studied; conduct- 
ing the necessary research can raise thorny 
ethical issues. Investigators and their insti- 
tutional review boards must be flexible and 
patient so that this work can progress. 

Informed, reasoned and voluntary consent 
is core to the ethical conduct of research, but 
the norms vary across cultures’. Even the 
standard practice of explaining a research 
protocol to study participants and then 
obtaining signed consent can raise complex 
issues. Some communities value verbal agree- 
ments more than written contracts, which 
they might view with suspicion. The very act 
of requesting signatures could create mis- 
trust and the misperception that participants 
are entering into binding agreements that 
they will not be able to withdraw from. In 
places with repressive political regimes or for 
undocumented immigrants, signed consent 
could put participants in danger. Women who 
make decisions about their own or their chil- 
dren’s participation in research without their 
partner’s consent could face serious family 
conflict or even violence. 

Alternative procedures tailor information 
and consent to participants’ needs. The basis 
of robust informed consent is respect — for 
dignity, intercultural dialogue, equality and 
solidarity. Researchers must find more ways 
to present information clearly, particularly 
to non-literate participants. Some investiga- 
tors use colourful visual aids or multimedia. 
Weare developing visual informed-consent 
forms with children in Canada’ and Cam- 
eroon in which potential research subjects 
take photographs to represent ethical issues. 
Children have, for example, photographed 
a lock or someone’s mouth with a finger 


Community gatherings can be part of an 
informed-consent process. 


crossing it to represent confidentiality. 

Researchers must also think about poten- 
tial consequences that a research project 
might have on participants and their com- 
munity. This requires time to build trusting 
relationships and to engage communities 
and local co-investigators. Focused discus- 
sion groups, local advisory committees and 
participants themselves can share oversight 
of the appropriateness of study design and 
implementation. Schedules must allow for 
presentations to multiple stakeholders and 
allocate time for participants to reflect on 
the study and discuss it with others before 
deciding to participate. 

Local beliefs about who has decision- 
making authority must be considered. In 
‘collectivistic cultures, which emphasize the 
needs of groups over individuals, informed 
consent operates through relationships. 
Other community members act as consult- 
ants or permission granters*. Physicians, 
elders, senior males or mothers-in-law might 
be assumed to know best. In western Kenya, 
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community assemblies known as mabaraza 
were consulted in a long-term study of 
children separated from their parents or 
orphaned, and they recommended commu- 
nity decision-making in the consent process 
for biomedical and behavioural research’. 

Informed consent at the community level 
can conflict with Western standards for 
voluntary individual decisions. In a child- 
protection study that I led in Laos, we first 
obtained explicit permission from provincial, 
district and village authorities, deemed nec- 
essary and sufficient by local standards. But 
our own principles and those of our institu- 
tion also made it essential to gain individual 
informed consent from parents and assent 
from children. We learned to respect indi- 
viduals’ silences and left unstructured time 
so that people could depart before group 
discussions. In this way, people could decline 
participation without being singled out by the 
community in socially damaging ways. 

Assessing voluntary involvement is 
essential to produce valid research. Orphan 
children participating in group discussions 
as part of another child-protection study in 
Liberia all signed consent forms, yet we later 
realized that the children had been pressured 
to participate and told what to say by the 
administrators of the orphanages. 

The study of mental health across cul- 
tures requires creativity, tenacity and time. 
But these investments must be made. There 
is a dearth of cross-cultural mental-health 
research to inform practice. Ultimately, 
collaborative partnerships make for better 
research and boost the utility of findings. To 
relieve suffering, we must learn what works 
best in each setting, and we must ethically 
build the capacity to do so. = 
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PETER BARKER/PANOS 


Strength in 


diversity 


Richard B. Freeman and Wei Huang reflect on a link 
between a team’s ethnic mix and highly cited papers. 


ticking with co-authors with similar 

surnames to yours might dent the 

impact of your work. The reason is 
unclear, but bibliometrics suggest that 
teams with greater ethnic diversity gener- 
ate papers that make more ofa splash in the 
scientific literature. 

We analysed’ 2.5 million research papers 
in which all of the authors had US addresses. 
Our study showed that US-based authors 
with English surnames were more likely 
have co-authors with English surnames than 
would occur by chance; those with Chinese 
names were more likely to have co-authors 
with Chinese names, and so on. The trend 
held for seven other groups, including Rus- 
sian and Korean populations, between 1985 
and 2008 in 11 scientific fields, including bio- 
medicine, physics and geosciences. 

The results hint that scientific research is 
much like the rest of social life. Studies of 
social networks find that people eat with, 
work with and generally connect with others 
similar to themselves, a tendency that some 
sociologists call homophily. 

To the extent that surnames can be a proxy 
for ethnicity, homophily in scientific collab- 
orations also seems to be related to a work's 
reception in the scientific community. After 
controlling for numbers of authors and for 
factors such as an ethnic groups’ population 
density, we find that greater ethnic homo- 
geneity among authors is associated with a 


paper's publication in lower-impact journals. 
It also predicts fewer citations. Papers with 
four or five authors of multiple ethnicities 
have, on average, one to two more citations 
than those written by authors all of the same 
ethnicity. This effect represents a 5-10% dif- 
ference in the mean number of citations for a 
given publication. 

What might explain this observation? 
Scientists with lacklustre or fewer papers may 
have a narrower pool of potential collabora- 
tors. Homophily is greater for authors with 
weaker publication records. But even when 
we compare work from authors with similar 
publication histories, homophily is still asso- 
ciated with lower-impact papers. 


NETWORK EFFECTS 

Teasing out the implications of these find- 
ings is difficult. Teams with members from 
diverse ethnic backgrounds may benefit from 
a greater variety of perspectives. Researchers 
have been shown to think differently when 
they work in diverse groups because they 
expect greater challenges to their ideas, or 
because small group dynamics are altered’. 
Given that communication can be hampered 
by linguistic or cultural differences’, perhaps 
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researchers make an extra effort to work 
through these challenges on research ques- 
tions that are likely to have particular impact. 

Network effects offer a different sort 

of explanation. A paper generated bya 
more diverse research group could tap 
into different networks and thus attract 
greater attention and citations, an effect 
observed in patents studies*, and in 
inter-institution and international col- 
laborations®. And although using jour- 
nal impact factors to infer the quality 
of individual papers is controversial, 
that relationship, too, indicates that 
diverse teams publish stronger papers. 
In other words, greater diversity of author- 
ship might boost either the quality of the 
paper or the number of people who notice 
it, or both. One way to distinguish between 
the two would be to examine the terms, 
techniques and references in papers. If eth- 
nic diversity contributes to productivity by 
widening ideas, papers from more-diverse 
collaborations should contain a wider range 
of scientific terms, use more varied equip- 
ment, procedures, or data and reference a 
wider range of previous work than papers 
from homogenous groups. In the biomedi- 
cal sciences, the medical subject heading, or 
MeSH, terms would provide a natural meas- 
ure, as might automated text analysis. 

Another approach would be to model 
the probable impact of network effects on 
citations and then estimate the effect of co- 
authors in differently sized ethnic networks. 
This type of analysis could also be used to 
determine the mechanism by which inter- 
institutional or international collaborations 
often have greater impact than collaborations 
written at a single address. 

Finally, we are studying the ethnic mix of 
collaborators who met at scientific meet- 
ings, and the impact of resulting papers. This 
would control for variation in the opportu- 
nity to meet people of different ethnicity, and 
could isolate people’s preference for homoph- 
ily or diversity. 

These are questions worth pursuing. 
We need to work out what makes the most 
creative and effective scientific teams. = 
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WALKER AND WALKER/GETTY 


AUTUMN BOOKS 


Under the skin 


Nathaniel Comfort wonders at the enduring trend of misrepresenting race. 


published this year argue the question. 
All miss the point. 

Michael Yudell’s Race Unmasked and 
Robert Sussman’s The Myth of Race can be 
read as inadvertent retorts to former New 
York Times journalist Nicholas Wade’s A 
Troublesome Inheritance, published while the 
former were in the press. Wade's book is by far 
the most insidious, but all three are polemics 
that become mired in proving (in Wade’ case) 
or disproving (in the others’) whether race is 
biological and therefore ‘real: This question is 
a dead end, a distraction from what is really 


I s race biologically real? A clutch of books 


A Troublesome Inheritance: Genes, Race 
and Human History 

NICHOLAS WADE 

Penguin: 2014. 


Race Unmasked: Biology and Race in the 
20th Century 

MICHAEL YUDELL 

Columbia University Press: 2014. 


The Myth of Race: The Troubling Persistence 
of an Unscientific Idea 

ROBERT WALD SUSSMAN 

Harvard University Press: 2014. 
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at stake in this debate: human social equality. 

Race is certainly real — ask any African 
American. It originated long before the 
science of genetics, as sets of phenotypes 
and stereotypes. These correlate with haplo- 
types, clusters of genetic variation. In this 
sense, race is genetically ‘real. But those cor- 
relations depend on judgement calls. Wade 
cites population-genetics studies that identify 
three principal races: caucasian, African and 
East Asian. Elsewhere he cites five, adding 
Australasian and Native American; or seven, 
splitting caucasians into people from Europe, 
the Middle East and the Indian subcontinent. 


ILLUSTRATIONS BY DARREN HOPES. 


A study in Scientific Reports this year iden- 
tified 19 “ancestral components’, including 
Mozabites, Kalash and Uygurs (D. Shriner 
et al. Sci. Rep. 4, 6055; 2014). Palaeogeneticist 
Svante Paabo and others have revealed the 
underlying human genetic variation to be 
a series of gradients. Whether and how one 
parses that variation depends on one’s train- 
ing, inclination and acculturation. So: race 
is real and race is genetic, but that does not 
mean that race is ‘really’ genetic. 

The completion of the draft human- 
genome sequence in 2000 led some opti- 
mists to forecast the end of race (one of them, 
Craig Venter, wrote the foreword to Yudell’s 
book), but use of the term in the biomedical 
literature has actually increased since then. 
For clinicians, race is a matter of pragma- 
tism. Although each of us is genetically and 
epigenetically unique, our ancestry leaves 
footprints in our genomes. Consequently, 
clinicians use familiar racial categories such 
as ‘black’ or ‘Ashkenazi Jewish’ as crude mark- 
ers of genotypes, in a step towards individual- 
ized medicine. For them, the reality of race is 
immaterial; diagnosis and treatment are what 
count (see page 301). 

Debates over the genetic reality of race, 
then, are not mainly scientific, but social. 
They deploy the cultural authority of science 
— considered society's most objective way of 
understanding the world — as a fig leaf for 
positions motivated explicitly or implicitly by 
ideology. All three of these books argue that 
if the proof or disproof of race is scientific, it 
must be true. The author must be right. More 
importantly, his opponents must be wrong. 

For Wade, science proves that race is 
genetic. Much like Richard Herrnstein and 
Charles Murray's The Bell Curve (Free Press, 
1994), his book moves smoothly through 
seemingly reasonable arguments that humans 
are still evolving, to end up at the retrograde 
conclusion that Europeans have become the 
world’s richest and most powerful people 
mainly because they are genetically the most 
open, curious, innovative and hard-working. 
Also like The Bell Curve, Wade's book draws 
heavily on a long tradition of what histori- 
ans refer to as scientific racism, particularly 
research connected to the Pioneer Fund, 
chartered in 1937 in part to “support study 
and research into the problems of hered- 
ity and eugenics” and, as Sussman shows, 
still deeply involved in eugenic and racial 
research. Despite such transparently politi- 
cal sources, Wade insists that his argument is 
based on ideology-free science. On 8 August, 
139 population geneticists — including sev- 
eral on whose work Wade based his argu- 
ments — signed a letter to The New York 
Times declaiming his use of their results. Now 
that those whose work he once categorized as 
“scientific” instead of “ideological” have come 
out against the book, Wade has denounced 
them, too, as being motivated by politics. 


By contrast, the ‘race realist’ and ‘human 
biodiversity (HBD) groups are delighted 
with Wade’s book. For example, Jared 
Taylor, editor of the HBD magazine Ameri- 
can Renaissance, applauds Wade's argument 
that (in Taylor’s paraphrase), “foreign aid is 
probably wasted because poor countries are 
not genetically prepared for the institutions 
necessary for wealth”. Other pillars of the 
race-realist movement, such as the website 


DEBATES OVER THE 


GENETIC 
REALITY 


OF RACE ARE NOT 


MAINLY 
SCIENTIFIC 


BUT 


SOCIAL. 


Stormfront and the writer John Derbyshire, 
gave Wade's book glowing reviews. 

For both Yudell, a historian of public health, 
and Sussman, a cultural anthropologist, sci- 
ence proves that race is cultural. In making 
this case, both devote considerable space to 
eugenics, the science and social movement 
concerned with human hereditary improve- 
ment. The eugenics movement — particularly 
in the United States in the early twentieth 
century and in Nazi Germany — offers a 
cornucopia of evidence of scientific racism. 
But, in focusing on the US movement’s most 
egregious leaders, such as Charles Davenport, 
Madison Grant and Henry Fairfield Osborn, 
both Yudell and Sussman over-simplify. 
Eugenics was about much more than just 
race. Recent scholarship has documented the 
pervasiveness and adaptability of the US and 
UK eugenic creed and the complicated ways 
it mingled with race, public health and femi- 
nism. Sussman and Yudell both, for exam- 
ple, discuss the efforts of the distinguished 
black leader W. E. B. Du Bois against white- 
supremacist eugenics in the early twentieth 
century. But both fail to mention his concern 
over black “dysgenics” and his eugenically 
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inflected “talented tenth” campaign, which 
sought to identify the “best of this [black] 
race’. Not all eugenics is racist, and most rac- 
ism is not — or not principally — eugenic. 

For both Yudell and Sussman, the anti- 
dote to eugenic hereditarianism was cultural 
anthropology, developed by Franz Boas and 
his students in the late nineteenth and early 
twentieth centuries. Boas coined the word 
culture in its modern sense, and became per- 
haps the greatest opponent of the biological 
concept of race. He and his students studied 
human societies through an entirely cultural 
definition of human difference. Boas found, 
for example, that cranial characteristics that 
had been claimed to be innately racial were 
the result of differences in nutrition and 
overall health. Sussman and Yudell insist that 
Boasian anthropology scientifically proved 
that race is not genetic. 

Their arguments then diverge. Sussman 
becomes, if possible, more polemical, 
whereas Yudell grows slowly less so. The 
former returns to the history of scientific 
racism, providing a passionate account of the 
continuing influence of the Pioneer Fund. 
Yudell’s late chapters, by contrast, trace the 
struggle to strip racism from race science. 

Yudell offers a rich analysis of the state- 
ments on race by the United Nations Educa- 
tional, Scientific and Cultural Organization 
(UNESCO) — a string of contentious multi- 
disciplinary reports that sought to document 
scientific knowledge on race while denounc- 
ing racism, starting in 1950. The project’s 
myriad authors split into two factions. 
One, led by Boas’s student Ashley Montagu, 
wanted to call race a fiction, a product of cul- 
ture. The other insisted that genetics showed 
that race was real. Theodosius Dobzhansky, 
a brilliant population geneticist, was intel- 
lectually invested in the genetic concept of 
race, yet morally invested in anti-racism. 
“Dobzhansky’s paradox’, in Yudell’s phrase, 
was how to save biological race theory with- 
out sounding racist. He never did — and nor 
have we, Yudell concludes poignantly. 

A full-throated, intellectually rigorous anti- 
racism must critically assess both biological 
and cultural evidence about race. It must 
acknowledge that no work on race science 
can be free of ideology — and, precisely for 
that reason, it must not place historical actors 
before a moral green screen showing an image 
of contemporary values. Rather, it must set 
the stage for each scene with meticulous, 
empathetic historical detail. Such work would 
allow the scientific study of ‘racial superior- 
ity — inherently grounded in subjectivity and 
bias — to fall on its own sword. m 
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In thrall to uncertainty 


A history of how quantum theory has permeated Western culture refreshes Jim Baggott. 


uantum theory is the most accu- 

rate and precise description of the 

molecular, atomic, sub-atomic and 
sub-nuclear realms ever devised. It is also 
utterly exasperating. To anyone tutored 
in the language and the logic of classical 
physics, it is mathematically challenging, 
maddeningly bizarre and breathtakingly 
beautiful. As charismatic US physicist Rich- 
ard Feynman warned: “Nobody understands 
quantum mechanics.” 

Given its recondite nature, quantum 
weirdness has entered popular culture in 
remarkable ways. What might otherwise 
have been the preserve of dry academic 
texts and stuffy scientific conferences has 
become common currency. Who hasn't 
heard of Heisenberg’s uncertainty princi- 
ple or Schrédinger’s cat? Quantum ideas of 
space, time and matter inspired British art- 
ist Anthony Gormley’s vast, enigmatic steel 
sculpture Quantum Cloud near London’s 


NEWIN 
PAPERBACK 


Highlights of this 
season’s releases 


The Quantum Moment: 
How Planck, Bohr, 
Einstein, and Heisenberg 
Taught Us to Love 
Uncertainty 

ROBERT P. CREASE 

AND ALFRED SCHARFF 
GOLDHABER 

W. W. Norton: 2014. 


O2 arena. And UK-based dramatist Tom 
Stoppard’s 1988 play Hapgood blends the 
duality of the double agent with a quantum 
duality in which matter and light are both 
waves and particles. Both of these works are 
cited in The Quantum Moment, in which 
philosopher Robert Crease and physicist 
Alfred Goldhaber explore quantum theory’s 
enduring cultural impact. 

Based on a class that the authors have 
taught for six years at Stony Brook Univer- 
sity, New York, the book asks why quan- 
tum theory carries such a metaphorical 


Alan Lightman (Vintage, 2014) 


detaching itself from reality. 
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punch — “wild and mysterious, packed 
with creative force” — and why it seems to 
be rediscovered in every generation. The 
authors’ tale is structured along approxi- 
mately linear historical lines, from Max 
Planck's discovery in 1900 that light can be 
described in terms of discrete ‘bundles’ of 
energy (quanta) to Bell’s theorem, which 
opened the door from the 1960s onwards 
to some extraordinary experimental tests of 
the nature of our physical reality. Each chap- 
ter explores how some of the core ideas and 
principles that sprang from these historical 
moments have been absorbed into (inevita- 
bly mostly US) popular culture. This makes 
for an entertaining read, even for those 
already familiar with the story. 

As Crease and Goldhaber explain, much 
of the impact on modern culture derives 
from what quantum theory has to say about 
the nature of reality. Arguably, centuries of 
observation, experimentation and strenu- 
ous intellectual endeavour were founded 


Physicist and literary wizard Alan Lightman reflects on how our cosmos, potentially 
one among uncountable others, has fortuitously created the perfect conditions for life. 
He considers intricate symmetries in nature and the unfathomable vastness of space. 
This journey through seven overlapping ‘universes’ — frameworks for exploring 
recent research — culminates in a vision of humanity hooked on technology, gradually 


on scientists’ expectation that the material 
world is composed of some kind of funda- 
mental atoms of ‘stuff. Quantum theory, 
however, has rewarded these endeavours 
with phantom particles that, like waves, 
can be both here and there; a theoretical 
structure that tells us only what might hap- 
pen (not what will); and quantum systems 
seemingly connected over great distances, 
giving rise to extended, non-local effects, or 
what Albert Einstein called “spooky action 
at a distance”. 

Einstein famously rejected the element 
of chance that lies at the heart of quantum 
theory, declaring that God does not play 
dice. He argued that quantum theory is not 
complete. Those scientists who, like Ein- 
stein, are less inclined to accept that we have 
reached an ultimate limit of what is know- 
able remain firmly in denial. So, in the past 
40 years or so, the efforts of agents provo- 
cateurs such as John Bell and Tony Leggett 
have encouraged an orgy of sophisticated 
laser-based experiments to test the foun- 
dations of quantum physics — what I have 


elsewhere called “experimental philosophy”. 
It is this work that has prompted the current 
interest in quantum cryptography, quantum 
computing and the teleportation of photons. 

Ihave only one quibble with The Quan- 
tum Moment. Crease and Goldhaber sup- 
port their narrative with ‘interludes’ after 
each chapter, designed to provide techni- 
cal details, including some equations. The 
exposition here is a little drier than in the 
main chapters, but does not need to be. The 


material also necessarily repeats much of 


what has already been covered, which can 
become a little tedious. The authors suggest 
that readers might prefer to skip these inter- 
ludes; for linear readers like me, that does 
not really work. 

Those versed in quantum theory’s 
practical applications might be tempted 
to dismiss its many manifestations in 
popular culture as what the authors call 
“fruitloopery”. And certainly, there is a 
lot of nonsense out there. But, as Crease 
and Goldhaber make abundantly clear at 
several points, many esteemed physicists 
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(who should probably know better) have 
been more than willing to indulge their 
inner metaphysician in research papers and 
popularizations on the mistaken principle 
that, as the Canadian philosopher Marshall 
McLuhan once put it, “mud sometimes 
gives the illusion of depth” 

Thus we smile at the comical pronounce- 
ments on physics by US actress Shirley 
MacLaine, until the authors point out that 
she could be paraphrasing similar pro- 
nouncements made 55 years earlier by the 
British physicist James Jeans. I have else- 
where argued that contemporary theoreti- 
cal physics has become rather self-indulgent 
and self-referential, a malaise that I have 
called fairy-tale physics. Deep questions 
about the nature of reality tend to provoke 
this kind of response, and it surely finds its 
origin in the quantum moment. = 


Jim Baggott is the author of The Quantum 
Story and Farewell to Reality. He is based in 
Reading, UK. 
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LINGUISTICS 


The write stuff 


Steven Pinker’s provocative treatise on language use and 
abuse would benefit from more data, finds Paul Raeburn. 


o conversation about the science 
N:« language can get very far with- 

out a mention of Steven Pinker, the 
Harvard University cognitive scientist who 
has not yet made linguistics as popular as 
football — but is working on it. In The Sense 
of Style, he wants to give us the cognitive 
science, linguistics and psychology behind 
classic debates over proper English, from 
passive voice to split infinitives. 

Plenty of others have given us stuffy decrees 
intended to end the interminable wrangling, 
but Pinker is different. He is unhappy with 
the classic style manuals — including revered 
texts such as Strunk & White (William Strunk 
and E. B. White's The Elements of Style) or 


Fowler’ Modern English Usage. We need a 
new guide “infused by the spirit of scientific 
skepticism’, he writes, using grammar and 
research on “the mental dynamics of reading” 
to replace edicts with evidence. Pinker gave us 
the science in The Language Instinct (William 
Morrow, 1994); in The Sense of Style he sets 
out to offer its practical application. 

He covers much of the same ground as 
the classic guides, including frequently mis- 
used words (“fulsome” and “noisome”) and 
the serial comma. His problem with Strunk 
e& White, however, is that the authors lack 
tools for analysing language, and so end up 
“vainly appealing to the writer's ‘ear”. That’s 
on page two. By page three, he is challenging 


— the manual’s dismissal 

oe of the passive voice. 

Linguistic research, 

he later writes, has 

shown that the pas- 

sive actually “allows 

the writer to direct the 

reader's gaze, like a cin- 

: ematographer choos- 

The Sense of Style: ing the best camera 

The Thinking angle”. What research, 
Person’s Guide to tiv? Pinker d 

Writinginthe 21st ‘*@CUY* Finker does 

Century not tell us. His views 

STEVEN PINKER are informed by 

Allen Lane: 2014. psycholinguistics; that 

is his day job. But he 

promises us science, so I expected to see data. 

However, in this instance, and in many others, 

the data are not there. 

Similarly, Pinker’s view on infinitives is 
to split them “if you need to’, a conclusion 
backed by dictionaries and style manuals — 
not research. And when he quotes with admi- 
ration the opening line of Richard Dawkins’ 
Unweaving the Rainbow (Houghton Mifflin, 
1998) — “We are going to die, and that > 


y 


Love and Math: The Heart of Hidden Reality 
Edward Frenkel (Basic Books, 2014) 

With infinite passion, media-feted professor 
Edward Frenkel shares his rise to mathematical 


greatness against a tide of Russian anti-Semitism. 
Appeasing maths-haters, he uses a borscht recipe 


to explain quantum duality. (See Marcus du 
Sautoy’s review: Nature 502, 36; 2013.) 


Life at the Speed of Light 

J. Craig Venter (Abacus, 2014) 

Biologist J. Craig Venter shares his life’s work of 
catalysing progress in biological engineering, 
sequencing the human genome and ultimately 
creating the first “synthetic cell” (Mycoplasma 
mycoides JCVI-syn1.0). (See Nathaniel Comfort’s 
review: Nature 502, 436-437; 2013.) 
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> makes us the lucky ones” — he offers a 
detailed explanation of why it works that 
is, again, short on science. 

Pinker is a good writer and a deeply 
humanistic one, and there are many 
bright moments here. His lists explain- 
ing right and wrong usage with a range 
of examples (enervate means to sap, not 
energize) are a useful desk reference. 
Among numerous good tips is one on, 
as Pinker has it, “the compulsion to name 
things with different words when they 
are mentioned multiple times” “Herons 
are herons,” he writes, not “long-legged 
waders, azure airborne aviators, or sap- 
phire sentinels of the sky”. 

At times, however, Pinker’s own writing 
verges on the incomprehensible. Consider 
his critique of this sentence: “Toni Mor- 
rison’s genius enables her to create novels 
that arise from and express the injustices 
African Americans have endured” Some 
might say ‘her is an error, because an 
adjective (“Toni Morrisons’) cannot be 
the antecedent of a pronoun. But Pinker 
explains it this way: “Toni Morrison’ is 
not an adjective, like red or beautiful; it’s 
a noun phrase in genitive case. (How do 
we know? Because you can't use genitives 
in clear adjectival contexts like That child 
seems Lisa’ or Hand me the red and Johns 
sweater.)” After reading that several times, 
I think I know what he means. But it is 
tough to get through. 

Pinker also reveals himself at the outset 
to be nota prescriptivist, like Strunk and 
White, but a descriptivist, who sees lan- 
guage as “a wiki that pools the contribu- 
tions of millions of writers and speakers”. 

I agree: we make the language. But if 
that is the case, science probably can't do 
any better than Strunk & White at dictat- 
ing style. The only legitimate data come 
from the people. So maybe it is too soon 
to jettison the classic style manuals: I 
suspect much of Pinker’s sense of style 
comes less from his science than from his 
own wonderful writer’s ear. m 


Paul Raeburn is the author of four 
books and the chief media critic for the 
Massachusetts Institute of Technology's 
Knight Science Journalism Tracker. 
e-mail: paulraeburn@nasw.org 


EVOLUTION 


Tribes like us 


Tim Lenton is intrigued by E. O. Wilson’s sweeping 
perspective on humanity’s past — and possible futures. 


hat of that ultimate existential 
question, the meaning of life? 
Aristotle saw it as the achieve- 


ment of happiness. UK comedy troupe 
Monty Python suggested that it involves 
reading “a good book every now and then’. 
In The Meaning of Human Existence, biolo- 
gist E. O. Wilson offers a good book that 
adds to such prescriptions, but readers seek- 
ing a sense of purpose will be disappointed. 
What Wilson is after is really a deeper under- 
standing of human existence. 

Still, there can be few better guides 
through our species’ past journey and 
potential for the future. Wilson provides the 
literary equivalent ofa greatest-hits album, 
giving us a pithy synthesis of his formidable 
body of work from Sociobiology (Harvard 
University Press, 1975) to The Social Con- 
quest of Earth (Liveright, 2012), with a lib- 
eral dose of Consilience (Little, Brown, 1998). 
The result is a provocative and beautifully 
written collection of essays, although one 
that struggles to be more than the sum of 
its parts. 

In the opening section, Wilson intro- 
duces his central premise that humans, like 
his beloved ants, are eusocial animals. Some 
individuals reduce their own lifetime repro- 
ductive potential so that they can raise the 
offspring of others (think of grandmothers 
after menopause). Key to the origin of euso- 
ciality is the creation of a nest, from which 
some of the population undertake risky for- 
aging while the remainder stay safe at home. 
Wilson argues that our unique intelligence 
began to evolve when our ancestors tamed 
fire to cook, settled around the campsite and 
senta fraction of the group off to risk life and 
limb hunting down energy-rich meat. 

Thus began a tension between acting 
for ourselves and acting for our group, 
which Wilson argues is at the heart of our 


conflicted human 
nature. Here he parts 
company with most 
evolutionary theorists, 
revisiting an already 
acrimonious debate 
(aired in Nature) over 
the origin of euso- 
cial traits. Wilson 
originally supported 
evolutionary biolo- 
gist W. D. Hamilton's 
theory of inclusive 
fitness, in which the 
costs of altruism can be rationalized ifthey 
are outweighed by the product of the ben- 
efits to recipients and the recipients’ relat- 
edness to the altruist. But in 2010, he and 
some colleagues rejected it (M. A. Nowak 
et al. Nature 466, 1057-1062; 2010). In its 
place, they argued for a mixture of individual 
and group-level selection. 

Back from the firmly prodded ants’ nest 
of evolutionary theorists came a predictably 
forceful defence (see, for example, P. Abbot 
et al. Nature 471, E1—-E4; 2011), but Wilson 
remains unmoved by this stinging riposte. 
The frustration for the neutral reader is that 
both sides agree that the gene is the funda- 
mental unit of selection, so the squabble is 
over different flavours of standard evolu- 
tionary theory. Neither side seems to see 
the Pythonesque irony of fighting over how 
to understand cooperation. Still, nothing 
could better demonstrate the tribal nature 
of humanity, which provides a focus for the 
rest of the book. 

Wilson’s enthusiasm for a mixture of 
individual and group-level selection goes 
further, as he struggles to resist an “over- 
simplistic” portrayal that “individual selec- 
tion promoted sin, while group selection 
promoted virtue”. The inconsistency in this 


MEANING 


The Meaning of 
Human Existence 
EDWARD O. WILSON 
Liveright: 2014. 


The Compatibility Gene 

Daniel M. Davis (Penguin, 2014) 

At the heart of our immunological-response 
systems lie ‘compatibility genes’, which 


determine each body’s capacity to fight diseases 


or accept medication. Immunologist Daniel 
Davis explores these genes’ roles in successful 
skin grafts, ill-fated pregnancies and more. 
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Brave Genius 

Sean B. Carroll (Broadway, 2014) 

Against the tumult of the Second World War, 
biologist Sean Carroll tells the interwoven stories 
of philosopher Albert Camus and geneticist 
Jaques Monod, friends who worked for the 
French resistance and won Nobel prizes. (See Jan 
Witkowski’s review: Nature 501, 487-488; 2013.) 


is soon exposed when he argues that religion 
has been crucial in reinforcing group-level 
tribalism, but is a collective sin that human- 
ity needs to grow out of. One wonders what 
the publishers were thinking when they put 
on the dust jacket the promise of Wilson 
addressing “our greatest moral dilemma 
since God stayed the hand of Abraham’, 
given that inside, he decries belief in God 
with Dawkinsian fervour. 

Yet Wilson's route to species self-knowl- 
edge is rather omniscient, because it involves 
comparing ourselves to other known or 
imagined life forms, be they ants or aliens. 
As we decimate biodiversity, leaving our- 
selves lost in an Age of Loneliness — the 
‘Eremocene’ — Wilson looks skyward for 
salvation. He is excited about exoplanets and 
brimming with existential confidence that 


The Joy of Pain 


Richard H. Smith (Oxford Univ. Press, 2014) 
Psychologist Richard Smith explores the roots of 
Schadenfreude (joy in others’ pain) in society, from 
reality television thriving on public humiliation 

to cases of envy-incited crimes, including Nazi 
persecution of Jewish people. (See Dan Jones’ 
“ihe , review: Nature 500, 147; 2013.) 


we are not alone in the Universe, offering a 
very anthropomorphic “portrait of E.T”. 
Wilson’s imaginary aliens are, I think, 
really his prescription for humanity’s 
future. They have chosen not to supplement 
their intelligence or engineer their genet- 
ics, because their technological creations 
have long surpassed them physically and 
intellectually. They are not so foolish as to 
attempt interstellar travel — were it possible 
— because they have worked out that invad- 
ing an independently evolved world would 
be a biological train wreck. Instead, they 
have made peace with their home planet 
and achieved a long-term sustainable state. 
Back in the here and now on Earth, Wil- 
son argues that we should relish our evolu- 
tionary legacy of internal conflict and the 
creativity it sparks. He sees a short future 


AUTUMN BOOKS Quinny 


for scientific progress but a long one for the 
humanities, arts and social sciences. Sur- 
prisingly optimistic that brain-activity map- 
ping is going to solve the riddle of human 
consciousness sooner rather than later, 
Wilson feels that we will be left clutching 
the sensation of free will, which he thinks is 
just an adaptation necessary for our sanity. 
If the resulting nihilism does not lead us to 
despair, the way forward will be to unify the 
sciences and humanities to reach a higher 
state of human “meaning”. Anyone fancy 
the ride? = 


Tim Lenton is professor of Earth system 
science at the University of Exeter, UK, 
and co-author with Andrew Watson of 
Revolutions That Made the Earth. 
e-mail: t.m.lenton@exeter.ac.uk 


Dosed: The Medication Generation Grows Up 
Kaitlin Bell Barnett (Beacon, 2014) 

Journalist Kaitlin Bell Barnett, herself medicated 
in youth, tells the stories of five people who from 
childhood have been treated with psychotropic 
drugs for conditions such as depression and 
attention-deficit hyperactivity disorder, addressing 
their sense of lost freedom and identity. 
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CLIMATE POLICY 


A societal sea change 


Nico Stehr ponders Naomi Klein’s call for strategic mass action on climate change. 


his year, New Zealand became the 
| first country to acknowledge climate 
change when granting residence on 
humanitarian grounds — in this case, to 
a family from the low-lying Pacific island 
nation of Tuvalu. That the environmental 
and human realities of climate change are 
tangled up in such legal, economic and polit- 
ical complexities is the focus of social activist 
Naomi Klein’s This Changes Everything. 

Klein’s journey to this realization began 
in 2009 when, in the middle of the inter- 
national economic meltdown, she first rec- 
ognized the magnitude of climate change. 
As she states: “I denied climate change for 
longer than I care to admit. Five years on, 
she has synthesized her thinking about the 
dual financial and environmental disasters, 
arguing that the “market fundamentalism” 
favoured by the United States and the United 
Kingdom — involving privatization of public 
services, deregulation of corporate activities 
and reduced public spending — has “system- 
atically sabotaged our collective response 
to climate change”. Klein now advocates a 
mediating social force between climate sci- 
ence, politics and individual responsiveness 
and responsibility — in essence, “mass social 
movements” aiming to reduce fossil-fuel 
use and push for adaptation measures and 
behavioural change. 

Klein’s book — a combination of polemic, 
manifesto and analysis — covers much famil- 
iar territory. We get the history of global- 
warming discussions, the phenomenon of 
climate denialism and today’s economic 
order, including the clash of interests between 
trade regimes and climate policy. As Klein 
notes, the “liberation of world markets, a 
process powered by the liberation of unprec- 
edented amounts of fossil fuels”, is now 
helping to accelerate the melting of Arctic 
ice. She also discusses responses to the cli- 
mate emergency, including geoengineering 
schemes, as well as resistance to large-scale 


mining projects in var- 
ious parts of the world. 
She includes her own 
eye-witness reportage 
from the front line, 
such as Canada’s oil 
sands and the 2010 
Deepwater Horizon 


oil spill in the Gulf of 
This Changes Mexico. 
Everything: Klein acknowledges 
Capitalismvsthe that extraction and use 
Climate _ of fossil fuels is hardly 
eehealey analogous to social or 
Simon and Schuster/ 


political oppression 
such as slave owner- 
ship or gender discrimination, but argues 
that historical movements against such prac- 
tices show how societal pressure can build, 
pushing governments to act. Mass action 
focused on climate change could have a 
transformative effect on societies, she posits, 
empowering the poor to demand rights and 
services. 

In arguing that the “fundamentalist” 
changes to the structure of capitalism have 
stymied such a transformation, Klein over- 
states the case, however. Although privati- 
zation and deregulation triumphed in the 
1980s in the United States and the United 
Kingdom, they did not in significant parts 
of the rest of the world, as she claims. And 
whether the US economic order exemplifies 
a secular trend towards global dominance 
remains an open issue. The conditions 
for change could be more favourable than 
Klein thinks, especially when it comes to 
the removal of ideological roadblocks to 
improving the ethics of markets on the basis 
of moral rather than mere monetary motives 
of production and consumption. 

Of course, many other impediments to 
governing climate change remain. Klein is 
too optimistic in claiming that the immedi- 
ately relevant solutions, such as adaptation or 


Allen Lane: 2014. 


reducing fossil-fuel use, have broadly been 
long understood. The governance of climate 
change, not merely mitigation and adapta- 
tion, is a tortuous problem and is hardly 
well developed theoretically, but it remains 
a key area of interdisciplinary research and 
real-world policy. We are just at the begin- 
ning of this difficult intellectual and practi- 
cal journey. Klein recognizes the Sisyphean 
tasks ahead and proposes economic, legal 
and social measures that would enable bet- 
ter governance, such as the reform of trade 
law and the prevention of fossil-fuel extrac- 
tion through the recognition of indigenous 
peoples rights to oil- and coal-rich land. 

The special appeal of Klein’s position 
is her insight that any successful effort to 
curb emissions or adapt to climate change 
demands popular, pragmatic and sensible 
transformative goals that go well beyond 
mere fencing in. In contrast to climate sci- 
entists and observers such as James Hansen 
and James Lovelock, she is not an advocate 
of “inconvenient democracy” — that is, the 
claim that certain forms of democratic gov- 
ernance need to be overcome before climate 
change can be tackled effectively. 

Whether This Changes Everything has 
identified the potential catalyst that will bring 
about an alternative future remains uncer- 
tain. Still-to-be-born large social movements 
around the world could act as a countervail- 
ing force to ‘business as usual and, as Klein 
puts it, “simultaneously clear some alterna- 
tive pathways” to safer futures for human- 
kind. Klein has explored early manifestations 
of such resistance among smaller, often ad 
hoc local social groups around the world. But 
her message is still delivered with a strong 
North American accent. = 


Nico Stehr is Karl Mannheim Professor of 
Cultural Studies at Zeppelin University in 
Friedrichshafen, Germany. 

e-mail: nico.stehr@t-online.de 


The Dynamics of Disaster 

Susan W. Kieffer (W. W. Norton, 2014) 

Geologist Susan Kieffer showcases Earth’s most 
destructive processes, highlighting geographical 
discrepancies in disaster preparedness. In 2010, for 
example, similar-energy earthquakes caused over 
50,000 deaths in Haiti, yet none in New Zealand. (See 
Roger Bilham’s review: Nature 502, 438-439; 2013.) 
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What Makes a Hero?: The Surprising Science of 
Selflessness 

Elizabeth Svoboda (Current, 2014) 

Would you risk your life for a stranger's? Survival 
instinct would suggest not, but science writer 
Elizabeth Svoboda finds that heroism comes 
naturally to some, and others can learn altruism 
using methods such as compassion meditation. 


Technology and its discontents 


Jaron Lanier surveys four studies probing the vexed nexus of mind and digisphere. 


igital technology is remaking the 
D cognitive environment in which 
human brains develop and func- 
tion. This swift revolution is inevitably 
sparking much hard thinking. Books by 
neuroscientists Susan Greenfield and 
Daniel Levitin, and writers Nicholas Carr 
and Paul Roberts, propose either adapta- 
tion to the changes — self-help strategies 
to compensate for emerging cognitive mis- 
alignments — or critiques of the overall 
transformation. 
Greenfield's Mind Change takes the latter 
approach. It proposes that global climate 
change can serve as a useful metaphor for 


Quest for a New Utopia 


Smart Cities: Big Data, Civic Hackers, and the : = 


Anthony M. Townsend (W. W. Norton, 2014) 

As technology infiltrates urban life, Anthony 
Townsend observes how cities evolve in the digital 
sphere, from parking apps in Germany to crowd- 
sourced maps of African slums. (See Melanie 
Moses’ review: Nature 502, 299-300; 2013.) 


Mind Change: How Digital Technologies Are 
Leaving Their Mark on Our Brains 

SUSAN GREENFIELD 

Rider: 2014. 


The Impulse Society: What’s Wrong With 
Getting What We Want? 

PAUL ROBERTS 

Bloomsbury: 2014. 


The Organized Mind: Thinking Straight in 
the Age of Information Overload 

DANIEL J. LEVITIN 

Dutton: 2014. 


how human minds — our inner environ- 
ments — are, in her view, being recklessly 
altered by digital technologies. Greenfield 
argues that because the human brain is 
remarkably plastic in youth, it is not unrea- 
sonable to ask how recently introduced, 
ubiquitous digital designs (such as those of 


*SYCHOpay, { 
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The Glass Cage: Automation and Us 
NICHOLAS CARR 
W. W. Norton: 2014. 


social networks or reading tablets) might 
affect brain development. The acquisition 
of speech and reading can affect human 
brain architecture, but there has been 
little precedent for the kind of sudden, 
uniform, pervasive change in children’s 
cognitive environments posed by these 


The Psychopath Inside 

James Fallon (Current, 2014) 

After confusing his own brain scan with a 
psychopath’s, neuroscientist James Fallon 
trawled his past and genealogy. Assembling 
evidence from obsessive-compulsive disorder 
to violence in his family history, Fallon considers 
how nurture may overcome nature. 
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technologies. How might they affect the 
sense of identity or organic memory, for 
instance? Although she sometimes seems 
to push her argument beyond the reach 
of current research, Greenfield asks key 
questions — such as whether the next 
generation will think less critically than 
their forebears. And she broadly out- 
lines the kind of research and policy 
agenda needed to address such haunting 
unknowns. 

She occasionally veers into alarmism, 
for instance when discussing specula- 
tive links between the apparent rise in 
autism and the rise in the use of particu- 
lar digital environments. However, some 
of Greenfield’s caution may be justified. 
The neuroscience and cognitive-science 
communities that overlap with digital- 
technology developments often rely on 
the technology industry for support or 
cooperation, so it is especially impor- 
tant that they are not swayed by that 
industry’s extreme enthusiasms. For 
all its faults, Mind Change is an impor- 
tant presentation of an uncomfortable 
minority position. It should be read by 
technologists in particular, as a check on 
self-congratulation. 

By contrast, in The Organized Mind, 
Levitin takes the self-help approach. 
Accepting the design of information tech- 
nology and today’s information deluge as 
givens, he explores better brain function 
in that context. Our networked age often 
confounds the human mind, he notes, 
because of the kinds of cognitive quirks inves- 
tigated by psychologists Daniel Kahneman 
and his late colleague Amos Tversky — nota- 
bly Kahneman’s idea of two brain systems, 
one ‘quick and dirty’ and the other slower 
and more reasoned. Levitin’s strategy for over- 
coming such quirks is a set of tricks. To bypass 
poor intuitions about statistics, for instance, 
he suggests assessing data using a simple four- 
fold diagram. 

Levitin’s presentation is sensible and 
actionable, but I suspect that his audience 
is the sub-population residing between the 
extremes of technical ability. This group 
holds much of society's money and power: 
our highly technical society is for the most 
part guided by semi-technical people. 

Carr's The Glass Cage — a meditation on 


AUTOMATION IN THE AGE OF 
CLOUD COMPUTING IS OFTEN A 


FAKE FRONT. 


REAL PEOPLE, 


ANONYMIZED AND 
DEVALUED, 


ARE THE SOURCES OF 
THE ‘BIG DATA. 


automation, from apps-for-everything to 
self-driving cars — asks at the start how we 
should define a human being in such an era. 
Does automation change the sense of how 
people act, learn, or find value in their lives 
and each other? Carr tells contemporary and 
historical tales of technologists and entrepre- 
neurs dripping with hubris, such as aviation 
wizard Wilbur Wright, and of people strug- 
gling with a sense that they are becoming 
denatured by a reliance on automation. 
Carr can be understood as part ofa liter- 
ary movement that does not reject technolo- 
gies. Rather, it rejects ceding what Carr calls 
“choices about the texture of our daily lives” 
to technologists and their businesses. That 
stance is a tightrope walk: one must move 
forward, succumbing neither to Luddite 


tendencies nor to the seductions of hot 
technological trends. 

Carr is one of our most accomplished 
tightrope walkers. However, The Glass 
Cage does fall prey to a flawed conceit. 
Automation in the age of cloud comput- 
ing is often a fake front. It is real people, 
anonymized and unvalued, who are the 
sources of the ‘big data’ that allow cloud 
algorithms to function. Automatic lang- 
uage translation is made possible only 
through daily sampling of human trans- 
lators’ work. Celebrating how people are 
contributing to technology in new ways 
could address some of the problems Carr 
decries, whether economic or cognitive. 

For The Impulse Society, Roberts draws 
on the work of research psychologists such 
as Walter Mischel, who has studied delayed 
gratification. More lament than prescrip- 
tion, the book considers the many ways in 
which technologies encourage an infantile 
desire for immediate gratification. What 
is most striking about Robert's critique is 
its panoramic sweep. During the financial 
crises of the past decade, for instance, an 
urge for an instant ‘hit’ cropped up among 
individual borrowers keen on home own- 
ership, lenders set on unbelievable deals, 
and shareholders eager for soaring security 
valuations. At every level people were dis- 
abled by a common infatuation with false 
gold proffered by digital networks. 

Roberts trips a bit towards the end 
of his book: he calls for a resurgence of 
traditional community as an alternative 
to the modern trend towards impatience. 
The book’s ultimate programme seems sen- 
timental and ill-matched to the theatre in 
which the troubles arise. 

Taken together, these four books reveal a 
frontier of human experience. We are rap- 
idly changing society, and in the course of it 
potentially laying our brains open to change. 
We must now become both competent and 
wise in our powers — not simply resist- 
ing or embracing new media technologies, 
but becoming instead more self-aware and 
discerning in relation to them. m 


Jaron Lanier is a computer scientist with 
Microsoft Research. His latest book is Who 
Owns the Future? 

e-mail: jalani@microsoft.com 


The Simpsons and their Mathematical Secrets 
Simon Singh (Bloomsbury, 2014) 

US television series The Simpsons is craftily 
dotted with maths jokes by numerate writers 
who chose comedy over academia. Physicist 
Simon Singh exposes and explains gags of 
varying complexity, although all can chuckle at 
Homer’s naive belief in an “infinity plus one”. 


Shores of Knowledge: New World Discoveries 
and the Scientific Imagination 

Joyce Appleby (W. W. Norton, 2014) 

Six centuries of overseas exploration is lucidly 
charted by historian Joyce Appleby. While 
voyagers exulted over exotic species, the spread 
of disease to indigenous peoples exposed the 
high price of scientific discovery. Emily Banham 
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Ebola virus control 
needs local buy-in 


International guidelines 
describe effective measures for 
the prevention and control of 
the Ebola virus. But we need 
more practical information 

on how to implement these 
measures, including potential 
therapies and a safe vaccine, in 
non-Western settings (Nature 
513, 13-14; 2014). 

Culturally tailored procedures 
must be detailed in international 
public-health protocols, 
drawing on the expertise of 
medical anthropologists. 

Such measures will also help 
to overcome mistrust of 

authorities and field workers 
among affected populations. 

The current Ebola outbreak 
will not be controlled without the 
local population’s understanding 
and cooperation. 

Gilles Guerrier Hétel-Dieu and 
Cochin Hospitals, Paris Descartes 
University, France. 

Eric D’Ortenzio Solthis, France; 
and Bichat Hospital, Paris, France. 
guerriergilles@gmail.com 


Climate models: 
sunk by humans? 


Paul Palmer and Matthew 
Smith’s argument that human 
adaptation to climate change 
should be incorporated into 
climate-projection models is 
entirely reasonable (Nature 
512, 365-366; 2014). However, 
I suspect that doing so could 
render such models essentially 
useless. 

Climate models are created 
with the intention of providing 
predictions that are more 
reliable, and as such must always 
wrestle with the bias—variance 
dilemma. Introducing human 
responses to climate change 
will make this issue much more 
challenging than it already is — 
and perhaps hopelessly so. 

To put it bluntly, one does 
not need to be an expert in 
modelling non-linear systems to 
recognize that the best answer 


to the question “How can we get 
more precise predictions?’ is 
never ‘Add lots more variables: 
Robert A. J. Matthews Didcot, 
UK. 

rajm@physics.org 


Climate models: use 
archaeology record 


Archaeologists and historians 
have long investigated societal 
responses to climate change 
(see P. Palmer and M. Smith 
Nature 512, 365-366; 2014). 
These records are an underused 
resource in current climate- 
adaptation research, but offer 
scope for highly integrative 
meta-analyses that would be 
useful to climate scientists, 
science advisers and policy- 
makers, and could provide 
important information for local 
outreach efforts. 
Risk-reduction researchers 
have pointed out that responses 
to climate change are a mix 
of contemporary industrial 
(technological) measures 
and pre-industrial (social 
and community-based) 
ones. However, the use of 
palaeoenvironmental data by 
the Intergovernmental Panel 
on Climate Change as a basis 
for drawing up future climate- 
change scenarios is not matched 
by an equally sophisticated 
use of ‘palaeosocietal’ data for 
investigating human impacts 
and adaptive pathways. 
Archaeological and historical 
data could provide a solid 
evidence base for effective 
adaptations to climate change. 
Expanding the chronological 
scope of climate-adaptation 
research into deep time would 
vastly enlarge the database of 
available case studies without 
getting into the tricky issues of 
data access and legal sensitivity. 
In effect, this approach draws 
on natural experiments in 
history to learn from the past 
(see R. Van der Noort Climate 
Change Archaeology Oxford 
Univ. Press; 2013). 
Felix Riede Aarhus University, 


Hojbjerg, Denmark. 
friede@cas.au.dk 


Monitor Brazil’s fish 
sampling closely 


Brazil’s aquaculture and fisheries 
secretary decreed last month 
that 2,000 different species of 
ornamental fish can be legally 
removed from the Brazilian 
Amazon. The fish will be farmed 
to supply the aquarium trade. 
This raises concerns about 
over-exploitation and threats 

to biodiversity, particularly 
given the poor record of 
inspection and reinforcement 
by the country’s environmental 
agencies (see A. L. B. Magalhaes 
and J. R. S. Vitule Science 341, 
457; 2013). 

The new ruling could stimulate 
indiscriminate extraction, 
biopiracy, fish trafficking and 
the escape of farmed species 
into ecoregions of the country 
where they are not native. Close 
monitoring must be a priority. 

We should be educating people 
about how to conserve Brazil’s 
exuberant aquatic diversity, not 
encouraging its plunder. 

Jean R. S. Vitule Federal University 
of Parana, Curitiba, Brazil. 

Flavia D. E Sampaio Federal 
Institute of Paranda, Curitiba, Brazil. 
André L. B. Magalhaes Pontifical 
Catholic University of Minas 
Gerais, Belo Horizonte, Brazil. 
andrebiomagalhaes@gmail.com 


Shale gas is a fraught 
solution to emissions 


Qiang Wang suggests that 
shale gas might be used as a 
bridging fuel to cap China's 
carbon emissions (Nature 512, 
115; 2014). Extraction and 
development problems could 
make this difficult. 

The greenhouse-gas footprint 
of shale gas is much bigger than 
that of coal. Shale gas emits less 
carbon dioxide than coal or oil 
when burnt, but the methane 
produced during the extraction 
process has a global-warming 


potential 70 times that of CO, 
(see R. W. Howarth et al. Nature 
477, 271-275; 2011). 

Following the US shale-gas 
boom, China devised a plan to 
extract its own gas resources. 
This proved difficult and 
expensive owing to limited water 
availability and because the gas 
is located at depth under large 
amounts of subsurface clay. 

Furthermore, extraction 
might compromise the country’s 
already stressed aquatic 
environments (H. Yang et al. 
Nature 499, 154; 2013) and 
increase seismic activity — 
important factors in densely 
populated areas such as 
southwest China. 

Asa result, Chinese shale-gas 
production in 2013 was only 
around 3% of that planned for 
2015. Last month, this forced the 
country to halve its production 
target for 2020 (see go.nature. 
com/h5miza; in Chinese). 

In our view, China would 
be better offinvesting more in 
renewable energy and improving 
energy efficiency. 

Hong Yang University of Oslo, 
Norway. 

Julian R. Thompson University 
College London, UK. 
hongyanghy@gmail.com 


Aristotle’s suspect 
statistical skills 


In his review of Armand 

Marie Leroi’s book The Lagoon 
(Nature 512, 250-251; 2014), 
Roberto Lo Presti rightly praises 
Aristotle’s observational skills. 
But the philosopher may not 
have been as adroit in numeracy 
as he was in biology. 

Aristotle famously declared 
that “males have more teeth 
than females in the case of men, 
sheep, goats, and swine” (see 
his History of Animals, Book 2, 
Part 3). This was an obvious 
sampling mistake, which bears 
out the importance today ofa 
strong statistical foundation for 
biological curricula. 

Taner Z. Sen Ames, Iowa, USA. 
tanerzsen@gmail. com 
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GATM gene variants and statin myopathy risk 


ARISING FROM L. M. Mangravite et a/. Nature 502, 377-380 (2013); doi:10.1038/nature12508 


Mangravite et al.‘ identified six expression quantitative loci (eQTLs) 
that interacted with simvastatin exposure by using 480 lymphoblastoid 
cell lines exposed to B-hydroxy simvastatin acid in vitro. One of these 
SNPs (189806699) within the glycine amidinotransferase (GATM) gene 
was shown to have an association with statin-induced myopathy in two 
independent cohorts (n = 172 myopathy cases), conferring a protective 
effect (odds ratio = 0.61, 95% confidence interval = 0.39-0.95, P = 0.03). 
Our genotyping results from statin myopathy patients do not appear to 
replicate this finding. There is a Reply to this Brief Communication Aris- 
ing by Mangravite, L. M. et al. Nature 513, http://dx.doi.org/10.1038/ 
nature13630 (2014). 

Using primarily the UK Clinical Practice Research Datalink, an elec- 
tronic healthcare record database, we recruited 145 cases with statin- 
induced myopathy and 537 statin-exposed control patients”. In addition, 
five patients meeting our case inclusion criteria were identified prospec- 
tively through a tertiary adult muscle clinic. Our myopathy phenotype 
was defined as serum creatine kinase levels of greater than 4 X upper 
limit of normal (ULN) or clinical record of rhabdomyolysis concurrent 
with statin prescription. In a proof-of-concept study, using a subset of 
patients (78 cases, 372 controls)’ we were able to show an association 
between the SLCO1B1*5 allele (rs4149056) and both statin-induced myop- 
athy (odds ratio = 2.1, 95% confidence interval = 1.3-3.1) and severe 
myopathy (n = 23, odds ratio = 4.1, 95% confidence interval = 2.1-8.2), 
consistent with the genome-wide association study (GWAS) findings 
from the SEARCH collaborative’. 

We have undertaken genotyping for the rs9806699 GATM single- 
nucleotide polymorphism (SNP) in our cases and drug-exposed control 
patients (n = 150 and 587, respectively, after quality control) in order to 
attempt replication of the association shown by Mangravite et al... How- 
ever, we were unable to show a significant difference in the minor allele 
frequency of rs9806699 between myopathy cases (MAF = 0.28) and con- 
trols (MAF = 0.30) (odds ratio = 0.94, P = 0.68). The MAF in our cases 
was similar to that identified in controls in the paper by Mangravite et al.’. 
By limiting cases to just those with ‘severe’ myopathy (creatine kinase 
> 10 X ULN or rhabdomyolysis) (n = 37), we again failed to show a 
significant difference in MAF between cases and controls (odds ratio 
= 0.94, P = 0.83). Further analysis restricted to patients only receiving 
simvastatin (99 cases, 344 controls) also did not demonstrate an associ- 
ation between rs9806699 and risk of either myopathy (odds ratio = 1.12, 
P= 0.49) or severe myopathy (n = 26, odds ratio = 1.42, P = 0.24). 


Analysis restricted to the 120 cases that were not on drugs known to 
interact with statins also did not change the result. We have also under- 
taken genome wide analysis of 128 myopathy cases (Illumina Human 
OmniExpress Exome 8v1), and comparison with the WT'CCC2 (Well- 
come Trust Case Control Consortium 2) genotype data also did not show 
any association between statin myopathy (generalized or severe) and any 
of 90 typed or imputed SNPs within the GATM gene locus. 

In conclusion, we have not been able to replicate the association 
between the rs9806699 GATM SNP and statin myopathy reported by 
Mangravite et al.’ in an independent sample set despite the fact that all 
patients were of European ancestry and had similar statin-myopathy 
phenotypes. This association will need to be assessed in more patients, 
and through an individual patient-data meta-analysis to determine, first, 
whether the SNP is relevant to a sub-phenotype of statin myopathy, and 
second, its clinical and mechanistic relevance. 
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GATM locus does not replicate in 


rhabdomyolysis study 


ARISING FROM L. M. Mangravite et a/. Nature 502, 377-380 (2013); doi:10.1038/nature12508 


All HMG-CoA reductase inhibitors (statins) can cause muscle injury 
ranging from asymptomatic elevations in creatine kinase levels to severe 
muscle breakdown (rhabdomyolysis) leading to kidney failure and death’, 
and the genetic variants responsible for this uncommon adverse drug 
reaction remain largely undiscovered. Mangravite et al. reported a new 
locus in the gene GATM (rs9806699) that was associated with a decreased. 
risk of muscle injury in two case-control studies of myopathy (odds ratio, 
0.60)*. In a larger case-control study of statin-related rhabdomyolysis, 


a more severe form of muscle injury, we were unable to replicate this 
finding. This failure to replicate raises questions about the role of 
GATM in statin-related muscle injury. There is a Reply to this Brief 
Communication Arising by Mangravite, L. M. et al. Nature 513, http:// 
dx.doi.org/10.1038/nature13630 (2014). 

Mangravite et al. used differential gene expression profiling of lym- 
phoblastoid cell lines exposed to simvastatin to identify cis-expression 
quantitative trait loci (eQTLs) for the gene GATM as candidate loci for 
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Table 1| Association of GATM loci with the risk of cerivastatin-related rhabdomyolysis 


All subjects 


Excluding fibrate users 


SNP Cases (N= 175), MAF Controls (n=645), MAF OR 95% Cl Pvalue Cases (n= 76), MAF Controls (n = 643), MAF OR 95% Cl P value 
rs9806699 0.27 0.28 1.01 0.70-1.45 0.96 0.24 0.28 0.84 0.52-1.36 049 
rs1719247 0.29 0.25 1,37 0.98-1.90 0.07 0.24 0.25 1.00 0.64-1.57 0.99 
rs 1346268 0.29 0.27 1.25 0.90-1.73 0.18 0.24 0.27 0.88 0.52-1.36 0.57 


Rhabdomyolysis case subjects had creatine kinase levels >10 x the upper limit of normal and used cerivastatin at the time of onset of symptoms of muscle pain or weakness. Control subjects did not experience 
rhabdomyolysis and used the following statins: lovastatin (44%), simvastatin (19%), pravastatin (18%), atorvastatin (13%), fluvastatin (6%) or cerivastatin (1%). Cl, confidence interval; MAF, minor allele 


frequency; OR, odds ratio; SNP, single nucleotide polymorphism. 


pharmacogenomic associations with muscle injury, which they evaluated 
in two case-control studies of myopathy. Variation at their most signifi- 
cant cis-eQTL for GATM, rs9806699, was associated with a decreased risk 
of muscle injury (odds ratio = 0.60, 95% confidence interval = 0.39-0.95) 
in a study with 72 mild myopathy cases (blood creatine kinase levels > 3 X 
the upper limit of normal (ULN) with muscle symptoms) recruited 
from a healthcare organization (Marshfield). In a second study with 39 
mild and 61 severe myopathy cases (creatine kinase > 10 X ULN with 
muscle symptoms) using simvastatin during the SEARCH clinical trial, 
variation at two single-nucleotide polymorphisms (SNPs) in linkage dis- 
equilibrium with rs9806699 (1° = 0.7) was also associated with a decreased 
risk of muscle injury (rs1719247, odds ratio = 0.61, 95% confidence 
interval = 0.42-0.88; rs1346268, odds ratio = 0.62, 95% confidence 
interval = 0.43-0.90). On the basis of these epidemiologic findings and 
the results of functional studies in hepatocyte-derived cell lines, the authors 
identified GATM as a new genetic locus for statin-induced myopathy. 

We attempted to replicate these findings in a case-control study of 
rhabdomyolysis (creatine kinase > 10 X ULN and muscle symptoms) 
related to the use of cerivastatin**, which was removed from the market 
in 2001 because ofa high incidence of this adverse drug reaction. Rhab- 
domyolysis cases (175; 94.9% with European ancestry) were compared 
with statin-using control subjects from the Cardiovascular Health Study 
without rhabdomyolysis (645; 99.7% with European ancestry). Varia- 
tion at rs9806699 was not associated with the risk of rhabdomyolysis 
(odds ratio = 1.01, 95% confidence interval = 0.70-1.45), and variation 
at the other two SNPs was weakly associated with an increased risk 
(rs1719247, odds ratio = 1.37, 95% confidence interval = 0.98-1.90; 
181346268, odds ratio = 1.25, 95% confidence interval = 0.90-1.73). 
Ninety-nine rhabdomyolysis cases used fibrates, which can cause drug— 
drug interactions with statins, and excluding fibrate users also resulted 
in null associations (Table 1). Combining our results (all subjects) with 
the results of Carr et al.° and Mangravite et al.” in a fixed-effects meta- 
analysis resulted in null associations at rs9806699 (odds ratio = 0.88, 95% 
confidence interval 0.72-1.08, P = 0.22), rs1719247 (odds ratio = 0.86, 
95% confidence interval = 0.69-1.07, P = 0.17) and rs1346268 (odds 
ratio = 0.85, 95% confidence interval = 0.68-1.05, P= 0.12). There 
was statistical heterogeneity at rs1719247 (t* = 0.22, P= 0.001) and 
181346268 (17 = 0.14, P = 0.009). 

Although most cases from the SEARCH trial involved severe myo- 
pathy, it is possible that the GATM variants identified by Mangravite 
et al. protect against mild but not severe statin-related muscle injury. 
Other differences in the study populations could also result in hetero- 
geneity of the effects of these variants. An alternative explanation for the 
discrepant findings is that GATM is not related to this adverse drug reac- 
tion. By contrast, a non-synonymous variant in the drug transporter gene 
SLCOIB1 (rs4149056) that decreases the clearance of statins”* has been 
associated with statin-related muscle injury of various severity and statin 
types” ”. The odds ratio for the rs4149056 minor allele in our rhabdo- 
myolysis study (2.0) (ref. 3) was similar to the odds ratios in a study of 
less-severe myopathy cases related to simvastatin use (2.1) and ina recent 
meta-analysis (2.2) (ref. 11). In other words, the drug transporter encoded 
by SLCO1B1 is a widely replicated finding’. 
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The approach by Mangravite et al.” of identifying potential new phar- 
macogenomic interactions through differential gene expression profil- 
ing is innovative. However, the failure to replicate their findings in a 
large study of rhabdomyolysis raises questions about whether GATM 
represents a genuine genetic locus for this adverse drug reaction. 


Methods 


Case subjects were recruited through attorneys representing cerivastatin users who 
developed rhabdomyolysis. Trained abstractors reviewed medical records to vali- 
date rhabdomyolysis events. As cerivastatin comprised a small fraction of statin use 
during its market life (March 1998 to August 2001), it was not practicable to assem- 
ble a broad sample of cerivastatin users who did not develop rhabdomyolysis. Instead, 
the control group comprised statin-using participants of the Cardiovascular Health 
Study, a prospective cohort study of older adults'*"*. This work was funded by a 
grant from the NHLBI, HL078888. 
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REPLYING TO D. F. Carr et a/. Nature 513, http://dx.doi.org/10.1038/nature13628 (2014); J. S. Floyd et a/. Nature 513, http://dx.doi.org/10.1038/ 


naturel13629 (2014) 


Our study’ tested for associations of single-nucleotide polymorphisms 
(SNPs) at the GATM loci with statin-induced myopathy based on the 
finding that one of these SNPs (rs986699) was associated with statin- 
induced expression of GATM in a panel of human lymphoblastoid cell 
lines, and the fact that GATM encodes the enzyme responsible for syn- 
thesis of creatine, a major source of energy in skeletal muscle’. Signifi- 
cant associations with incidence of myopathy were found for rs9806699 
in statin users from the Marshfield Clinic cohort. Furthermore, signifi- 
cant association was reported in both the Marshfield cohort and in the 
SEARCH clinical trial of simvastatin treatment for two SNPs in linkage 
disequilibrium with the index SNP (rs1719247 and rs1346268, r>0.7) 
that were genotyped in each of these groups. We have extended our meta- 
analysis to include the study data reported in the accompanying Com- 
ments by Carr et al.’ and Floyd et al.’, two studies that individually failed 
to replicate this association. 

The original analysis was performed on data from patients who were 
not on fibrates in the Marshfield and SEARCH populations. This was 
done to mitigate the risk that a possible modest protective effect of the 
SNPs would be masked by the known pharmacokinetic confounding 
caused by concomitant use of fibrates or other drugs that promote myop- 
athy by altering statin pharmacokinetics*. We have done the same for 
the results of Floyd et al. in the meta-analyses presented below, although 
it is notable that, based on clinical presentation and creatine kinase 
levels, the majority of the myopathy cases of Floyd et al. were consid- 
erably more severe than in the originally reported cohorts*. This analyt- 
ical approach was not possible for the study of Carr et al., since data for 
this subgroup were not provided. In this regard we note that because 
pharmacokinetic effects are major determinants of statin toxicity, the 
confirmation by both Carr et al. and Floyd et al. of an association of 
myopathy with a functional variant of the transporter gene SLCO1B1 is 
not representative of the power of their analyses to detect a SNP asso- 
ciation with a modest pharmacodynamic effect. 

A fixed-effects meta-analysis yielded the following P values: 189806699 
(Marshfield, Carr et al. and Floyd et al.), P = 0.085; rs1719247 (Marsh- 
field, SEARCH and Floyd et al.), P = 0.0042; rs1346268 (Marshfield, 
SEARCH and Floyd et al.), P = 0.0035. Thus, the statistical significance 


of the initially reported association is weakened but not eliminated by 
the inclusion of the additional cohorts. Future efforts to replicate these 
findings should give consideration to heterogeneity of patient charac- 
teristics, matching of statin exposure in cases and controls, avoidance of 
concomitant drug use and other confounding factors, and the statistical 
power to detect an association of modest effect size. We agree with Carr 
et al. that the association should be assessed in more patients and hope 
that a larger meta-analysis will be performed. In addition, further studies 
will be required to determine a mechanistic basis for a contribution 
of GATM genetic variation to the risk of statin-related myopathy. This 
Reply is written by the subset of authors that designed and led these 
analyses. 
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NEWS & VIEWS 


EVOLUTIONARY BIOLOGY 


Radiating genomes 


Genome sequences and gene-expression data from representatives of five distinct lineages of African cichlid fish reveal 
signatures of the genomic changes that underlie the astounding cichlid diversity seen today. SEE ARTICLE P.375 


CHRIS D. JIGGINS 


here are more than 2,000 species of 
cichlid fish, the majority of which are 
found in three large African lakes. This 
species radiation is the result of somewhere 
between 20 million and 45 million years of 
evolution, although, remarkably, around 
500 species found in one of these lakes, Lake 
Victoria, arose in only the past 100,000 years”. 
The amazing diversity of these fishes, and the 
speed with which they have evolved, makes 
them truly one of the natural wonders of the 
world. On page 375 of this issue, Brawand 
et al.” report five genome sequences, one from 
each of the major lineages of African cichlid. 
The data offer insight into cichlid diversifi- 
cation and provide a rich resource for future 
genomic analyses. 

The species diversity of African cichlids is 
matched by their diversity in both ecology 
and morphology (Fig. 1). They occupy a huge 
range of ecological niches — ranging from 
some fairly standard fishy activities, such as 
eating algae or molluscs’, through to the more 
bizarre, such as the scale eaters, which use their 
asymmetric jaws to nibble scales from the sides 
of other fishes*. Many of these forms have 
evolved, apparently independently, in each of 
the different lineages, which have undergone 
varying degrees of species radiation. The new 
reference genomes offer exciting opportunities 
to identify the genetic changes that led to this 
extraordinary diversity of morphological and 
ecological traits. 

As well as being a resource for future stud- 
ies, the sequences hold intriguing clues to the 
genomic changes underlying the radiation. A 
well-established route for evolutionary inno- 
vation is through gene duplication, which can 
permit existing genes to diversify and take 
on new functions’. The genomes reported by 
Brawand and colleagues provide evidence for 
a burst of gene duplications associated with 
species radiation. This implies that natural 
selection has favoured the retention of dupli- 
cate genes in African cichlids, perhaps in part 
owing to their role in adapting to new envi- 
ronments. This hypothesis is also supported 
by the authors’ gene-expression data, which 
show that many of the retained duplicate genes 
exhibit new expression patterns. Notably, 20% 


Lake Malawi 


Lake Tanganyika 


Lake Victoria 


Pelagic zooplanktivore 


* 


Rock-dwelling algae scraper 


Scale eater 


Reef-dwelling planktivore 


Lobe-lipped insect eater 


River dwellers 


Snail crusher 


Figure 1 | Niche diversity. The differing feeding habits of cichlid fish in the three African lakes with the 
highest cichlid diversity — Lake Malawi, Lake Tanganyika and Lake Victoria — provide a sample of the 
diversity of ecological niches they occupy. Brawand et al.’ present the genome sequences of one species 
from each of these lakes and two river-dwelling cichlids (indicated by red stars). (For photo credits, see 


Figure 1 of the paper’.) 


of duplicate pairs have gained a completely 
new tissue-specific expression domain, con- 
sistent with gene duplication having led to a 
new gene function. 

In addition to gene duplication, there is 
genomic evidence for accelerated evolution of 
protein-coding genes in the cichlids as com- 
pared with stickleback fish, which have not 
undergone a similarly rapid radiation and so 
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provide a useful control group for this analysis. 
The accelerated evolution in cichlids was 
particularly striking in opsin genes, which 
encode proteins involved in colour vision, 
and in genes encoding members of the BMP 
signalling pathway, which influence a wide 
variety of developmental processes, including 
jaw development. 

Of course, divergence in gene function can 


also occur through changes in gene regulation, 
without a change in the protein-coding 
sequence, and Brawand et al. also addressed 
this. The cichlid genomes show evidence for 
enhanced rates of evolution in putative regula- 
tory elements, and high evolutionary turnover 
in microRNAs — a class of RNA molecule that 
regulates gene expression. Furthermore, the 
genomes reveal 40 new microRNA-encoding 
genes that, intriguingly, show complementary 
patterns of expression relative to the genes they 
are hypothesized to regulate. This suggests that 
they are involved in suppressing gene expres- 
sion, perhaps to stabilize and refine expres- 
sion patterns that have been acquired during 
the radiation. 

The authors attribute the great diversity 
of changes seen across these genomes to a 
period of relaxed selection that occurred early 
in the radiation. During this time, the selec- 
tive pressures that maintained the stability of 
the genome were reduced, thereby allowing 
genetic variation to accumulate and produce 
subsequent diversification into the lineages we 
observe today. However, accelerated evolution 
can result either from neutral evolution due to 
relaxed selection, or from positive natural selec- 
tion acting through new selective pressures. 
Most of the genomic signatures in the paper do 
not strongly distinguish between these two pos- 
sibilities. Indeed, it seems most likely that the 
retention of gene duplicates and rapid genetic 
divergence were primarily driven by positive 
natural selection, as species adapted to the great 
diversity of ecological niches available in the 
lakes. Subsequent extinction of early lineages 
could have led to an apparent burst of rapid 
change on the branch leading to the extant 
species. There may be no need to invoke a 
genetic revolution when plain old natural selec- 
tion can explain the observed patterns. 

Although the five genomes offer some 
impressive insights into cichlid biology, I 
believe that the most exciting advances will 
come from analysis of more-closely related 
genomes within each radiation. The cichlids 
offer in abundance two of the characteristics 
that have facilitated analysis of adaptive traits 
in other taxa: there are many closely related 
species that show highly divergent morphol- 
ogy, and there is repeated evolution of similar 
traits in parallel. Whole-genome sequencing of 
multiple individual fishes with both divergent 
and convergent ecological traits will provide 
rich pickings for understanding how genetic 
changes are associated with specific ecological 
characteristics. Brawand et al. have scratched 
the surface of this task by reanalysing sequence 
data from samples of six species found in Lake 
Victoria’; these suggest that even very closely 
related species show quite high levels of diver- 
gence across the genome. 

These genomes will facilitate further studies 
that will undoubtedly enhance our under- 
standing of cichlid biology. It may be rash, but 
I will make one prediction. Work in organisms 


ranging from sticklebacks’ to butterflies* has 
shown that recent adaptive events can make 
use of ancient genetic variants. This may be 
surprising, but can occur because gene flow 
within a species, or sometimes even between 
species®, can provide ‘pre-adapted’ variants 
that permit populations to adapt rapidly to 
new challenges. So I predict that similarities 
between cichlids in different lakes that are 
currently considered to have evolved inde- 
pendently will in fact turn out to have resulted 
in part from ancient shared variation that may 
have arisen early in the radiation”. m 
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Catching relativistic 


electrons 


Low-energy electrons have been found to mimic relativistic high-energy particles 
in cadmium arsenide. This defines the first stable ‘3D Dirac semimetal’, which 
holds promise for fundamental-physics exploration and practical applications. 


ZHIHUAI ZHU & JENNIFER E. HOFFMAN 


object’s energy varies as the square of its 

velocity or momentum (Fig. la) — a rule 
that car drivers should treat with respect. Pho- 
tons, neutrinos and other light, fast-moving 
particles are governed instead by Einstein’s 
theory of relativity: their energy scales lin- 
early with their momentum, with fixed velocity 
equal to the slope of the increase. Such rela- 
tivistic high-energy particles hold the key to 
fundamental understanding of our Universe. 
But where do electrons — which determine 
the more practical properties of the materi- 
als immediately around us — fit into this 
picture? Electrons move very fast, but their 
motion is not primarily relativistic in conven- 
tional solids. However, in a paper published 
in Physical Review Letters, Borisenko et al.’ 
report the discovery of relativistic motion of 
low-energy electrons in cadmium arsenide 
(Cd,As,). Taken together with similar find- 
ings described in three independent papers, 
by Neupane et al’, Liu et al.’ and Jeon et al.*, 
this result paves the way for future relativistic 
electronics. 

The realization that low-energy electrons 
can mimic high-energy relativistic particles 
occurred a decade ago with the isolation of 
two-dimensional (2D) carbon in the form of 
graphene’. This material has dual significance 
for the exploration of fundamental phys- 
ics and for revolutionary applications; it has 


I: classical Newtonian mechanics, an 


prompted more than 100,000 publications, 
some 7,000 patent applications and a 2010 
Nobel prize. Electrons in graphene are 
described as massless Dirac fermions because 
they have half-integer spin, which makes them 
fermions, and their linear energy-momentum 
relationship obeys Dirac’s famous wave equa- 
tion, which first united quantum mechanics 
and special relativity almost a century ago. 
Graphene is also a semimetal, meaning that 
its Fermi energy (the dividing line between 
filled and empty electronic states) sits ideally 
at its ‘Dirac point’ — where its valence and con- 
duction energy bands meet (Fig. 1b) — and 
may be easily tuned using an applied voltage. 
The resultant charge carriers may be either 
electrons or holes (the absence of electrons) 
and have high mobility: a measure of inverse 
electrical resistivity per carrier, which increases 
with carrier velocity but decreases with carrier 
scattering. 

Graphene’s moderately high carrier 
velocity of about 10° metres per second, 
combined with the reduced intrinsic scatter- 
ing possibilities caused by the small carrier 
density inherent to a Dirac semimetal, can 
give a mobility up to 140 times that of silicon 
— the material of choice for most electronic 
applications. Therefore, graphene offers 
promise for making novel, high-efficiency 
electronic devices. However, graphene is 
challenging to fabricate and manipulate in 
large sheets, and its mobility is extremely 
susceptible to scattering from environmental 
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Figure 1 | Energy-momentum spectra of electrons. 


a, Classical objects and electrons exhibit a 


parabolic relationship between their energy (E) and momentum (p). b, Two-dimensional (2D) Dirac 
fermions, such as electrons in graphene, have valence (blue) and conduction (purple) energy bands with a 
linear energy-momentum relationship. These touch at a point called the Dirac point in the 3D parameter 
space formed by E, p, and Py. Shown here is a 2D slice of the 3D space. c, A 3D slice of the 4D (E, p,, Py Pz) 
energy-momentum relationship of 3D Dirac fermions such as those discovered’ in Cd, As,, with two 


Dirac points along a special high-symmetry axis (p.). 


defects because graphene is all surface. 

A second kind of 2D Dirac semimetal arises 
from another relativistic effect of electrons 
called spin-orbit coupling — the interaction 
between an electron’s spin and the induced 
magnetic field from the electron’s orbital 
motion. Spin-orbit coupling is generally small 
for materials that are made up of light atoms 
such as carbon, but for materials containing 
heavy atoms such as bismuth and cadmium, 
the interaction can be significant; for example, 
it can invert the valence and conduction bands 
in the bulk of an insulator. This inversion can 
lead to surface Dirac fermions that are topo- 
logically protected — surface carriers that are 
robust against some local disorder and have 
their spin locked to their momentum (that is, 
the carrier's momentum determines its spin). 

These ‘topological insulators’®’ provoked 
tremendous excitement in recent years about 
possible applications such as low-energy-con- 
sumption spintronic devices, which manipu- 
late the spin rather than the charge of electrons, 
for high-performance computing. But despite 
their name, existing topological insulators 
have excess conducting bulk electrons, which 
overwhelm the surface Dirac fermions and foil 
their use. 

Meanwhile, new ideas were brewing, 
suggesting that 3D Dirac semimetallic states 
could exist in the bulk of a solid material. It 
was known that such states could occur under 
finely tuned conditions, such as the exact con- 
centration of bismuth at which spin-orbit cou- 
pling becomes strong enough to invert the bulk 
energy bands in antimony-bismuth alloys® 
(Sb,_,Bi,). But more recent theoretical work 
predicted the robust occurrence of such states 
in pure materials that have certain crystalline 
symmetries: first, unstable BiO, (ref. 9), then 
air-sensitive Na;Bi (ref. 10) and, finally, the sta- 
ble compound Cd,As, (ref. 11). Furthermore, 
when time-reversal or spatial-inversion sym- 
metries are broken — for example, by appli- 
cation of a magnetic field or pressure — each 
Dirac point can split into two copies at which 
the electrons become Weyl fermions’, which 
have opposite chirality (spin orientation with 


respect to their direction of motion). These 
Wey] fermions could enable robust spintron- 
ics in three dimensions. 

Cd,As, has been known for more than 
50 years’ for its extraordinary carrier mobil- 
ity, which is larger than that of suspended 
graphene and among the highest of any bulk 
semiconductor. Thanks to the recent studies by 
Borisenko et al.', Neupane et al? and Liuet al.* 
— who all conducted experiments on Cd,As, 
using a technique called angle-resolved photo- 
emission spectroscopy (ARPES) — we now 
understand that the high mobility arises from 
high-velocity 3D Dirac semimetiallic states. 

During ARPES experiments, mono- 
chromatic light is incident on a sample and 
electrons can absorb a photon and escape from 
the material. To unveil the full 3D energy- 
momentum relationship of electrons within 
Cd,As,, a challenging but crucial step was to 
precisely measure the energy and momentum 
of emitted electrons while tuning the pho- 
ton energy through a wide range. The data’ 
clearly show a linear energy-momentum 
relationship, with two Dirac points along a 
crystal axis of four-fold rotational symmetry 
(Fig. 1c). This result proves that electrons in 
this material are 3D massless Dirac fermions 
as predicted’. Measurement of the energy- 
momentum slope gives electron velocity as 
high as about 10° ms” (ref. 2), but with a ten- 
fold discrepancy between the three studies), 
which could be due to differences in sample 
quality or the angle of the exposed surface. 
Liu et al. additionally demonstrated that the 

carrier concentration in Cd,As, could be 

finely tuned by ‘doping’ the surface of the 
material with potassium atoms’, making it a 
flexible platform for future studies. 

Most recently, Jeon et al.* used a scanning 
tunnelling microscope to confirm Cd,As, as 
a 3D Dirac semimetal down to atomic length 
scales, and to visualize how dopant atoms 
scatter carriers primarily in the valence band, 
preserving the mobility of carriers in the 
high-velocity conduction band. Furthermore, 
Jeon and colleagues applied a magnetic field, 
which is not possible in an ARPES experiment. 
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Although the field breaks the time-reversal 
symmetry that would be necessary to split the 
Dirac fermions into the more exotic chiral Weyl 
fermions, its orientation in this experiment also 
breaks the four-fold rotational symmetry of the 
crystal that was necessary to realize the Dirac 
fermions in the first place. This means that the 
first glimpse of Weyl fermions will need to wait 
for a follow-up experiment in which the mag- 
netic field has a different orientation. 

The work on Cd,As, (refs 1-4), together 
with the lower-mobility Na,Bi reported earlier 
this year", confirms the existence of motion of 
Dirac fermions inside 3D materials. Despite its 
exciting new physics, the application potential 
of Cd, As, is limited by its small band-inversion 
energy — the relativistic nature is not robust 
at room temperature’. Furthermore, Cd,As, is 
not exactly something you want in your drink- 
ing water. Nevertheless, given the new under- 
standing that robust Dirac fermions can arise 
in solids from general crystalline symmetries 
and strong spin-orbit coupling, there are prob- 
ably numerous 3D Dirac semimetals yet to be 
discovered’. Immediate research priorities 
include magnetic-field and pressure control to 
isolate chiral Weyl fermions in existing materi- 
als, realization of these materials as thin films 
to access a phenomenon known as the quan- 
tum spin Hall effect to visualize the spatial flow 
of surface Dirac fermions’’, and computa- 
tional modelling to predict new materials and 
heterostructures with larger band-inversion 
energies’*. Then exotic applications, such as 
a chiral battery’ or a ‘quantum amplifier’ of 
magnetic field, may be on the horizon”. m 
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ANIMAL BEHAVIOUR 


The evolutionary roots 
of lethal conflict 


A comprehensive analysis of lethal coalitionary aggression in chimpanzees 
convincingly demonstrates that such aggression is an adaptive behaviour, not 
one that has emerged in response to human impacts. SEE LETTER P.414 


JOAN B. SILK 


conflicts around the world’. Many of these 

had persisted for decades, killed thousands 
of people and thwarted international peace- 
keeping efforts. War is certainly a contempo- 
rary fixture, but has it always been one? There 
is vigorous disagreement over the answer to 
this question. Some argue that warfare has 
been a pervasive feature throughout human 
history and has had important effects on 
human nature’, whereas others contend that 
war is rare in foraging groups’, the kinds of 
societies that we lived in for most of our evo- 
lutionary history. Debates about the origins 
and prevalence of human warfare are echoed 
in the question of whether lethal coalition- 
ary aggression in chimpanzees has evolved 
through natural selection or whether it is a 
non-adaptive consequence of human distur- 
bance. In this issue, Wilson et al.* (page 414) 
argue persuasively on the side of adaptation. 

Many species of non-human primates have 
hostile relationships with members of neigh- 
bouring groups, and some species collectively 
defend the boundaries of their territories. 
But intergroup encounters rarely lead to seri- 
ous injuries or deaths, perhaps because the 
risks of escalated aggression usually do not 
outweigh the benefits of killing opponents. 
Lethal coalitionary attacks on individuals from 
neighbouring communities have been docu- 
mented only in chimpanzees. The first report 
of such killings was published 35 years ago’, 
but the debate about their adaptive significance 
continues. 

One point of view is that natural selection 
has favoured the evolution of lethal coalition- 
ary intergroup aggression in chimpanzees as a 
means to enhance access to valuable resources, 
such as food and mates. Intergroup aggression 
might be more deadly in chimpanzees than 
in most other species because chimpanzees 
can exploit the imbalances of power that arise 
from ‘fission-fusion’ social organization’. 
Chimpanzees often fragment into tempo- 
rary parties that travel and forage indepen- 
dently within their community’s home range. 
When parties of males encounter single 
individuals from other communities, they 
sometimes launch brutal assaults that leave 


I: 2013, there were 33 armed state-level 


victims gravely wounded or dead (Fig. 1). 

The opposing view is that lethal aggression 
is a non-adaptive response to anthropogenic 
influences, particularly artificial provision- 
ing. In early primate field studies, researchers 
often used food to lure animals out of the for- 
est, to facilitate close-range observation and to 
enhance habituation. At the Gombe National 
Park in Tanzania, where in 1962 primatolo- 
gist Jane Goodall began providing bananas to 
chimpanzees who visited her camp, this prac- 
tice had the desired effect. However, the chim- 
panzees began spending more and more time 
in the camp and rates of aggression among 
them increased. Provisioning was curtailed, 
and eventually terminated altogether. There 
were no reports of lethal aggression at Gombe 
before provisioning began, leading some 
researchers to conclude that these killings were 
the consequence of human intervention’. Sub- 
sequent reports of killings at other sites, where 
chimpanzees had never been provisioned, have 
been attributed to other forms of human inter- 
vention, including habitat loss’. 

Wilson et al. have comprehensively tested 
these two hypotheses by analysing 426 com- 
bined years of research at 18 chimpanzee 
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(Pan troglodytes) study sites, and 92 years of 
research at 4 bonobo (Pan paniscus) study 
sites. The authors assembled information on 
all instances of lethal aggression that have been 
observed directly by researchers, inferred from 
the nature of the victim's injuries or suspected 
on the basis of the circumstances of their 
deaths or disappearances. Coalitional killings 
were documented at 15 of the 18 chimpanzee 
study sites, but there was only one suspected 
killing among bonobos. 

The authors then tested how the frequency 
of killings by chimpanzees was affected by 
several variables linked to human impact, 
including provisioning and habitat distur- 
bance, and a second set of variables related to 
the intensity of resource competition, includ- 
ing the number of males and population 
density. The statistical model that best fits the 
data includes variables linked to the intensity 
of resource competition, rather than those 
linked to human impact. Specifically, the 
authors’ modelling shows that killings occur 
at higher rates in communities that have more 
males and higher population densities. 

These results should finally put an end to the 
idea that lethal aggression in chimpanzees is 
a non-adaptive by-product of anthropogenic 
influences — but they will probably not be 
enough to convince everyone. Perceptions of 
the behaviour of non-human primates, par- 
ticularly chimpanzees, are often distorted by 
ideology and anthropomorphism, which pro- 
duce a predisposition to believe that morally 
desirable features, such as empathy and altru- 
ism, have deep evolutionary roots, whereas 
undesirable features, such as group-level vio- 
lence and sexual coercion, do not. This reflects 
a naive form of biological determinism. Selec- 
tive pressures alter traits as organisms move 
into new environments and confront new 


Figure 1 | A male chimpanzee with fresh wounds following an inter-group attack. 
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challenges and opportunities. The data tell 
us that there are some ecological and demo- 
graphic circumstances in which the benefits of 
lethal aggression exceed the costs for chimpan- 
zees, nothing more. Humans are not destined 
to be warlike because chimpanzees sometimes 
kill their neighbours. 

For those who are persuaded by Wilson and 
colleagues’ evidence, a more interesting set 
of questions emerges. For example, how do 
chimpanzees overcome the collective-action 
problem? By eliminating rival males and 
infants sired by males from other communi- 
ties, chimpanzees gain access to new territo- 
ries and mating partners. But these benefits 
flow to the group as a whole, which creates 
opportunities for free-riding. Although the 
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imbalance-of-power hypothesis relies on the 
odds being in the aggressors’ favour, males 
forgo opportunities to mate and forage while 
they are on patrol, and run at least some risk 
of being injured in attacks. Do males that join 
patrols and lead attacks gain more benefits 
than those that remain in the security of their 
own community's territory? What forces cur- 
tail free-riding? The answers to these ques- 
tions will provide interesting insight into 
the selective forces that favour group-level 
cooperation in species without language, 
social institutions and systems for sanctioning 
free-riders. m 


Joan B. Silk is in the School of Human 
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Giant black hole in 
a stripped galaxy 


An oversized, supermassive black hole has been discovered at the centre of a 
densely packed conglomeration of stars. The finding suggests that the system is 
the stripped nucleus of a once-larger galaxy. SEE LETTER P.398 


AMY E. REINES 


upermassive black holes, which have 

masses millions or even billions of times 

that of our Sun, reside at the centre 
of almost every massive galaxy, includ- 
ing our Milky Way’. There seems to be 
some connection between the evolution 
of galaxies and that of these black holes, 
although the nature of the relationship is 
not well understood. What we do know is 
that, in general, bigger galaxies harbour 
bigger black holes at their centres, and 
these black holes are typically about 0.5% 
of the total mass of a spheroidal galaxy’s 
stars’. But on page 398 of this issue, Seth 
et al.” report the detection of an over- 
sized supermassive black hole that is a 
whopping 18% of the stellar mass of its 
unusual host. 

The dense stellar system in which the 
black hole has been found, M60-UCD1 
(ref. 3), is called an ultra-compact dwarf 
galaxy, and marks a previously unknown 
environment for supermassive black holes. 
Ultra-compact dwarf galaxies are densely 
packed spherical conglomerations of 
stars’. For years, astronomers have debated 
the nature of these objects — are they 
extremely massive star clusters, or are they 
the nuclei of galaxies that have had their 
outer layers stripped off through gravi- 
tational interactions with other galaxies? 


Seth and colleagues present the first clear case 
that an individual ultra-compact dwarf is a 
stripped-galaxy nucleus, because star clusters 
do not host supermassive black holes. 


M60-UCD1—e 


Figure 1 | Dwarfed by its neighbour. This composite 
image, constructed from data from NASA’s Chandra 
X-ray Observatory and Hubble Space Telescope, 

depicts the massive elliptical galaxy M60 and the nearby 
ultra-compact dwarf galaxy M60-UCD1. Seth et al.’ report 
that M60-UCD1 contains a supermassive black hole. 
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Seth et al. ‘weighed’ the black hole by 
determining its gravitational influence on 
nearby stars orbiting it’. To explain the 
observed stellar velocities and the distribu- 
tion of light within M60-UCD1, they had to 
invoke the presence of a central black hole 
with a mass 21 million times that of our Sun. 
A black hole of that size would be expected to 
reside in a host galaxy with a mass of about 
7 billion solar masses. However, Seth et al. 
estimate that M60-UCD1 has a stellar mass of 
only 120 million solar masses. 

Although the discovery of an enormous 
black hole in such a small galaxy is surpris- 
ing, recent work has uncovered a substantial 
number of black holes in other low-mass dwarf 
galaxies’. However, M60-UCD1 is clearly a 
different beast from those — it is far more 
compact and has a much more massive 
black hole. The small black holes in other 
low-mass dwarf galaxies are probably 
similar to the first ‘seeds’ of supermassive 
black holes*. Over cosmic time, such seeds 
grow by swallowing gas and coalesc- 
ing with other black holes during galaxy 
mergers. With a mass 200 times that of 
the smallest nuclear black holes known, 
M60-UCD1’s black hole seems to have 
already grown considerably. 

So how did such a big black hole get 
into such a tiny galaxy? The answer may 
be related to M60-UCD1’s galactic neigh- 
bourhood. This ultra-compact dwarf 
galaxy is right next door to the giant 
elliptical galaxy M60 (Fig. 1). Seth and co- 
workers’ simulations show that 
M60-UCD1 may have formerly been a 
more massive galaxy than it is now (with 
a proportionally sized black hole), but lost 
most of its stars in a gravitational tug of 
war while orbiting its giant neighbour. 
What is left today is the dense stellar 
nucleus and central supermassive black 
hole from the larger progenitor galaxy. 

The evidence for a supermassive black 
hole in M60-UCD1 is strong, but it is 
not the only possible explanation for the 


X-RAY: NASA/CXC/MSU/J. STRADER ET AL.; OPTICAL: NASA/STSCI 


data. Although seemingly unlikely, we cannot 
definitively rule out the existence of a popu- 
lation of low-mass stars or dead stellar 
remnants at the galaxy’s centre that do not 
produce much visible light. An X-ray source 
that might be produced by a supermassive 
black hole has been detected at the heart of 
M60-UCDl1, but this radiation could also 
be generated by the dead remnant of a star’. 
Follow-up observations at radio wavelengths 
could distinguish between these possi- 
bilities”’° and provide further support for the 
presence of a supermassive black hole. 

Seth and colleagues’ discovery is an impor- 
tant step towards understanding the nature 
of ultra-compact dwarf galaxies. Many other 
ultra-compact dwarfs show tantalizing hints 
that they, too, harbour supermassive black 
holes and are therefore stripped galaxy nuclei, 
but direct evidence is lacking. The authors 
are participating in ongoing observing 
programmes that may provide conclusive 
evidence for supermassive black holes in four 
other ultra-compact dwarfs. But at present, 
detecting the gravitational pull of a black 
hole on surrounding stars is feasible for only 
the brightest and closest systems. Attempting 
to detect the direct gravitational signatures 
of black holes in a large population of ultra- 
compact dwarfs must therefore wait for the 
next generation of telescopes. 


If supermassive black holes are indeed 
commonplace in ultra-compact dwarfs, this 
would have major implications for the demo- 
graphics of such black holes — Seth et al. esti- 
mate that there could be more than double 
the number of supermassive black holes in 
the local Universe than is presently thought. 
Although this is possible, it is far from certain. 
Future studies will tell us whether M60-UCD1 
is a fluke, or whether other ultra-compact 
dwarfs are also stripped galactic nuclei that 
host black holes. = 
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Shedding light on 
a change of mind 


Sophisticated genetic tools that make brain cells responsive to light have now 
been used in mice to trigger a memory connected with a particular place, and to 
switch its association from negative to positive, or vice versa. SEE LETTER P.426 


TOMONORI TAKEUCHI 
& RICHARD G. M. MORRIS 


e often believe that our memories 
are accurate, but in fact they can be 
malleable, changing over time as 


recollections become less precise or as events 
that never happened are falsely remembered’. 
There is also another way in which memory 
can change. The memory of a romantic first 
meal out with a partner may take on a differ- 
ent mood when the relationship falters. That 
ofa favourite family beach in summer may be 
destroyed by witnessing a swimming tragedy 
there. In these cases, memory of the place 
remains accurate, but the positive associations 
with that place are lost. On page 426 of this 
issue, Redondo et al.’ investigate the neural 
basis of this selective change. 


Our memories are representations of past 
experiences that are believed to be encoded 
in networks of neurons that fire together or in 
sequence. The representation of a particular 
place — a ‘where’ memory — is encoded in 
a brain structure called the hippocampal for- 
mation, which is embedded within the medial 
temporal lobe. A separate representation in the 
amygdala of the brain encodes a ‘what’ mem- 
ory, which recalls whether one feels good about 
a place (a positive valence) or has marked it off 
as dangerous (a negative valence). These two 
representations are thought to become con- 
nected during learning. The amygdala also has 
direct downstream connections to the action 
and endocrine systems that are involved in 
approach and avoidance’. 

Redondo and colleagues investigated 
the separate representations of ‘where’ and 
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50 Years Ago 


Journey to the Jade Sea. By John 
Hillaby — Books by writers who go 
to Africa in search of their souls are 
always interesting to us who went 
there in search of wages; this book 
is fascinating. “Essentially, I walked 
into the N.ED. for the hell of it” 

(p. 2); Mr. Hillaby took his own hell 
by using a small string of unhealthy 
camels for transport instead of a lorry 
or Land-Rover, as do other people 
in that part of Kenya. “Perhaps all 
safaris start this way. Somewhat 
despondently ...” (p. 7); they do not, 
but I should have been despondent 
if] had started with that collection 
of provisions ... in old cardboard 
boxes ... Messrs. Constable have 
published a most interesting 

book. They might also publish an 
interesting one by the Warden, for 
he no doubt would tell us more 
about the animals and plants. 
From Nature 19 September 1964 


100 Years Ago 


Let us consider lastly a disease 
which collects the last toll from 
one-seventh of humanity, and 
debilitates and enfeebles the lives 
of many whom it does not entirely 
destroy ... How are we organizing 
our campaign against tuberculosis? 
Bacteriology has taught us that 

it is an infectious disease and has 
isolated the organism ... all over 
the civilized world the total death- 
roll of human kind annually from 
tuberculosis probably does not 

fall short of a million souls ... This 
disease must be stopped at its source 
as well as dealt with on its course. 
No disease has ever been eradicated 
from a community by discovering 
cures for it, and none ever will; 
many diseases have disappeared 
because their sources have been cut 
off. Let us be scientific, let us search 
out the truth; having found it, let 

us act upon it, and let us conceal 
nothing that is true. 

From Nature 17 September 1914 
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‘what’? memories, and looked at whether the 
associations between them could be changed. 
To do this, they used several molecular- 
engineering tools, including optogenetics, 
which enables the manipulation of neurons 
in response to light. The authors genetically 
engineered male mice such that, when the 
antibiotic doxycycline was removed from 
their daily diet, a light-sensitive protein called 
channelrhodopsin-2 (ChR2) was able to be 
expressed. Depending on the genetic tools 
used, this took place in neurons of either the 
hippocampus or the amygdala in which the 
gene c-fos was active — a response to neural 
activity and learning. When doxycycline was 
briefly removed from the diet and the animals 
were given either contextual fear conditioning 
(a small electric shock) or reward conditioning 
(interaction with a female mouse), memory 
encoding caused c-fos activation in these brain 
areas and resulted in labelling of the ‘where’ or 
‘what’ memory neurons with ChR2. 

After training, the authors added doxy- 
cycline to the diet of the mice once again, 
preventing any further ChR2 labelling and 
ensuring that only the training memory rep- 
resentations could be activated by blue light. 
Redondo and co-workers then performed a 
place-preference test, in which blue light was 
turned on whenever the mice entered a desig- 
nated target zone. This selectively activated the 
ChR2-labelled neurons and the networks to 
which they had been associatively connected. 
Fear-conditioned mice duly moved away from 
the target zone, because the negative memory 
was optogenetically reactivated when they 
were in this area, whereas reward-conditioned 
mice stayed longer in the target zone, recalling 
the positive memory. 

The key aim of the study was to determine 
whether it is possible to change the ‘what’ 
association linked to a ‘where’ representation. 
The authors were able to do this, but the addi- 
tional conditioning did not involve returning 
the animals to the training arena. Instead, the 
researchers optogenetically reactivated the 
appropriate hippocampal neurons of fear- 
conditioned mice while allowing them access 
to a female. The outcome was a successful 
switch of the original aversive association with 
the target zone into an attractive association. 
A switch from attraction to fear could also be 
achieved. However, the amygdala representa- 
tions of aversion or attraction stayed as they 
were — their downstream connections were 
unchanged. 

These data imply that functional connec- 
tivity between memory representations in 
the hippocampus and the amygdala can be 
altered. Analysing these connections under 
the microscope, Redondo et al. found that 
additional conditioning led to decreases in the 
proportion of hippocampal-activated neurons 
in the amygdala that represented the original 
conditioning. Finally, they observed that ifa 
hippocampal representation was formed by 


fear conditioning, and then had its valence 
changed through subsequent conditioning 
with reward, the animals would later display 
less fear when returned to the original fear- 
conditioning box. 

It has long been apparent that memories 
can be changed from bad to good, or vice 
versa. What is so intriguing about this study 
is that the memory representations associated 
with a place are dissected into their network 
components and, rather than re-exposing the 
animals to the training situation to achieve a 
change, light is used to selectively reactivate 
the representation of the ‘where’ compo- 
nent of a memory and then change its ‘what’ 
association. 

Contemporary theories of learning are less 
about stimuli and responses than about the 
internal representation of events. For example, 
work done in the past 10 years has focused on 
associatively activated event representations in 
learning; that is, on the acquisition of memo- 
ries that can later be evoked by a reminder cue 
of a specific stimulus or by the act of return- 
ing to a particular place’. A key finding of this 
work was that associatively activated event 
representations can successfully substitute for 
the events that created them, with the possibil- 
ity that new learning will successfully associate 
new information to the event memory evoked 
by the reminder cue. This may result in an 
altered response to that cue. 

Optogenetic techniques, used so ingeniously 
by Redondo and colleagues, complement and 
expand on this previous work’ and on the clas- 
sical ‘disconnection approach, which involves 
unilaterally damaging two structures on oppo- 
site sides of the brain to establish the impor- 
tance of their anatomical connectivity for 


ORGANIC CHEMISTRY 


learning”®. The use of ChR2 cell labelling in a 
way that is temporally controlled and depend- 
ent on neural activity, followed by optogenetic 
reactivation of the representation, takes us 
closer to identifying the networks that underlie 
certain forms of memory. 

There are limitations to this optogenetic 
strategy, notably when sequences of neural 
firing are the essence of the memory’ (for 
instance, the memory of a musical tune ora 
sequence of actions). This is because the result- 
ing labelling will represent the sum of all the 
neurons that upregulate the activity-regulated 
genes, such as c-fos, rather than any explicit 
representation of sequence. Through opto- 
genetics and exceptionally careful design of 
behavioural studies, molecular engineering is 
nonetheless shedding light on our understand- 
ing of the underlying physiological networks 
of memory. = 


Tomonori Takeuchi and Richard G. M. 
Morris are at the Centre for Cognitive and 
Neural Systems, University of Edinburgh, 
Edinburgh EH8 9JZ, UK. 

e-mails: tomonori.takeuchi@ed.ac.uk; 
r.g.m.morris@ed.ac.uk 


1. Loftus, E. F. & Palmer, J. C. J. Verbal Learn. Verbal 
Behav. 13, 585-589 (1974). 

2. Redondo, R. L. et al. Nature 513, 426-430 (2014). 

3. LeDoux, J. E. Proc. Nat! Acad. Sci. USA 111, 
2871-2878 (2014). 

4. Saddoris, M. P, Holland, P.C. & Gallagher, M. 
J. Neurosci. 29, 15386-15396 (2009). 

5. Ettlinger, G. Brain 82, 232-250 (1959). 

6. Gaffan, D. & Wilson, C. R. E. Cortex 44, 928-935 
(2008). 

7. Xu, S., Jiang, W., Poo, M.-M. & Dan, Y. Nature 
Neurosci. 15, 449-455 (2012). 


This article was published online on 27 August 2014. 


Reactivity tamed 
one bond at atime 


A catalyst that couples together three reactants to form just one compound out 
of several possibilities, as a single mirror-image isomer, should simplify the 
synthesis of biologically relevant molecules. SEE ARTICLE P.367 


MATTHEW T. VILLAUME & PHIL S. BARAN 


hen the structural complexity of a 
molecule reaches a certain point, 
accessing it in a cheap, high-yield- 
ing and short chemical synthesis can seem 
impossible. As a result, interesting target 
molecules remain on the blackboard, never 
to be properly studied. Organic chemists are 
trying to deal with this by developing meth- 
ods that quickly assemble complex molecules. 
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A process called multicomponent coupling 
is at the forefront of this work, but it requires 
the reactivity of many different molecules to 
be tamed simultaneously, rather like orches- 
trating a three-ring circus. On page 367 of this 
issue, Meng et al.’ report remarkable progress 
in this compelling area of research. 

The chemical synthesis of structurally 
complex molecules, such as high-value com- 
pounds for the pharmaceutical, agricultural 
and materials industries, can be laborious 
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Figure 1 | One catalyst rules all. Meng et al.' report a catalyst that controls the reaction of three 
compounds — a monosubstituted allene, bis(pinacolato)diboron (B,(pin),) and an allylic phosphate 
— to forma single product out of many possible compounds. They used this product as a key 
intermediate in the synthesis of the naturally occurring antibiotic rottnestol. R’ and R” represent 
different attached chemical groups; Et, ethyl; pin, pinacolato (OC(CH;),C(CH;),0). 


and time consuming for even the most skilled 
chemist’. Each bond and group in a molecule 
adds exponentially to the time and care needed 
to produce it. Multicomponent coupling reac- 
tions help to solve this problem by joining 
together more than two reagents sequentially 
in a single reaction vessel’. 

These kinds of reactions have been studied 
for more than 100 years, with early work 
focused on the preparation of heterocyclic 
compounds (rings containing more than 
one kind of atom) through simple ‘condensa- 
tion’ chemistry’. The successful development 
of such reactions is an enormous challenge 
because of the need to balance an extra set of 
equilibria and competing side reactions for 
each new component of the reaction mixture. 
Most of the multicomponent reactions that 
have been developed so far have relied on the 
starting material’s innate tendency to react, 
leaving researchers at the mercy of that com- 
pound’s ‘desires. However, modern organic 
synthesis seeks to use engineered catalysts 
that override natural reactivity in an ordered 
fashion, methodically combining each 
component. 

Decades of catalysis research have brought 
us to the point at which many kinds of 
molecule can be formed with a large excess of 
one of their mirror-image isomers (enantio- 
mers)’. This has been a major focus of modern 
research, because often only one enantiomer 
of a drug will interact with a target enzyme. 
Catalysis offers an ideal solution to producing 
single enantiomers, because the source of 


enantioselectivity — a ligand at the reactive 
metal centre of the catalyst — is used in less 
than stoichiometric quantities. This greatly 
decreases costs. Although the development 
of such catalysts has been impressive, little 
progress has been made in combining multi- 
component coupling reactions with enantio- 
selective catalysis’. 

Meng et al. took this challenge head-on by 
developing a catalyst that promotes two enanti- 
oselective reactions between three compounds: 
a monosubstituted allene; bis(pinacolato) 
diboron (B,(pin),); and an allylic phosphate 
(Fig. 1). Many possible products can form 
from this combination of reagents, and find- 
ing the perfect catalyst to tame the reactivity 
of all three starting materials is a daunting 
challenge. 

On the basis of previous studies from 
the same laboratory’®, the authors knew 
that B,(pin), and the allene can react regio- 
selectively (preferentially at a particular 
atom) and enantioselectively in the presence 
of a catalyst. However, adding an allylic 
phosphate to the reaction mixture created 
several problems for developing a two-step 
process. The catalyst must now differenti- 
ate between the double bonds in the allylic 
phosphate and the allene to perform the first 
transformation correctly. Furthermore, the 
second reaction — called an allylic substitution 
— must not only be enantioselective, but also 
yield linear, rather than branched products 
(for those in the know, it must react through an 
Sy2 rather than an $2’ mechanism). 
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To achieve this, Meng et al. unsurprisingly 
had to perform extensive screening of cata- 
lyst ligands, and apply insight gained from 
experimental and computational mechanistic 
studies. This led to the discovery of a simple, 
copper-based catalyst that works for a wide 
range of substrates, and which provides high 
yields of product with the desired regio- and 
enantioselectivity. 

One of the key advantages of the new 
multicomponent reaction is that the final 
products contain three functionalizable 
handles (groups that can be converted into 
a variety of other chemical motifs). The 
products should therefore be of great utility 
for synthesizing molecules for pharmaceuti- 
cal or agricultural studies. To demonstrate this, 
Meng et al. completed two total syntheses of 
complex natural products using their reac- 
tion as the centrepiece. One of these syntheses 
— that of the antibiotic rottnestol — gave the 
natural product in almost seven times the 
overall yield of the next most efficient syn- 
thesis’. The authors prepared more than one 
gram of both compounds, highlighting the 
fact that their methodology can be reliably 
performed to generate meaningful amounts 
of material®. 

This work is a significant advance for 
transformations that combine enantioselective 
catalysis and multicomponent reactions, 
because it sets the powerful precedent of a 
single catalyst orchestrating the controlled 
union of many building blocks in precisely the 
right way. But as with any breakthrough reac- 
tion, there is still more to be done. The authors’ 
substrates are only monosubstituted allenes 
(which have one group attached to the allene 
core) and disubstituted allylic phosphates 
(two groups attached to the allylic phosphate). 
Extension to more highly substituted starting 
materials should expand the number of acces- 
sible products. In the meantime, the startling 
discovery that a simple catalyst can tame the 
complex reactivity of this daunting set of 
molecules bodes well for future reactions in 
which bonds fall in line. m 


Matthew T. Villaume and Phil S. Baran 
are in the Department of Chemistry, 
Scripps Research Institute, La Jolla, 
California 92037, USA. 

e-mail: pbaran@scripps.edu 


1. Meng, F., McGrath, K. P. & Hoveyda, A. H. Nature 
513, 367-374 (2014). 

2. Keasling, J. D., Mendoza, A. & Baran, P. S. Nature 
492, 188-189 (2012). 

3. de Graaff, C., Ruijter, E. & Orru, R. V.A. Chem. Soc. 
Rev. 41, 3969-4009 (2012). 

4. Joule, J. A. & Mills, K. Heterocyclic Chemistry 5th edn 
(Blackwell, 2010). 

5. Noyori, R. Asymmetric Catalysis in Organic Synthesis 
(Wiley, 1994). 

6. Meng, F., Jang, H., Jung, B. & Hoveyda, A. H. Angew. 
Chem. Int. Edn 52, 5046-5051 (2013). 

7. Czuba, |. R., Zammit, S. & Rizzacasa, M. A. 
Org. Biomol. Chem. 1, 2044-2056 (2003). 

8. Kuttruff, C.A., Eastgate, M. D. & Baran, P. S. 
Nat. Prod. Rep. 31, 419-432 (2014). 


18 SEPTEMBER 2014 | VOL 513 | NATURE | 325 


© 2014 Macmillan Publishers Limited. All rights reserved 


NatuULeiNSIGHT 


EXOPLANETS 


nature iNsIGHT 


Cover illustration 
Nik Spencer 


Editor, Nature 
Philip Campbell 
Publishing 
Richard Hughes 
Production Editor 
Jenny Rooke 
Art Editor 

Nik Spencer 
Sponsorship 
Reya Silao 
Production 

lan Pope 
Marketing 
Steven Hurst 


Editorial Assistant 
Melissa Rose 


The Macmillan Building 

4 Crinan Street 

London N1 9XW, UK 

Tel: +44 (0) 20 7833 4000 
e: nature@nature.com 


f& 


nature publishing group 


18 September 2014 / Vol 513 / Issue No 7518 


in the field will not remember this, but there was a 

period when the search for exoplanets had rather a bad 
reputation, based on a number of high-profile claims that 
were subsequently disproved. Although there was broad 
agreement, even by the 1980s, that planet formation ought 
to bea natural part of the star-formation process, at least 
for low-mass stars, we were still basing our assumptions on 
what we might find using the Solar System as a template. 

On a late-summer morning in 1995, I picked up anew 
manuscript by Michel Mayor and Didier Queloz. After 
reading the paper, I thought, ‘this looks pretty promising: 
My next thoughts were, how could a planet be so close to 
its parent star — it seemed very unlikely that it could form 
there — and was such a planet stable against evaporation 
by stellar radiation? I picked up the phone to a colleague 
knowledgeable in such things, and the second question 
was rapidly answered in the affirmative. The first question 
is still a topic of research, although the two main options 
of disk migration and gravitational scattering emerged 
quite rapidly. 

Nineteen years later, this collection of exoplanet papers 
reviews the state of the field. It is only fitting that Mayor, 
along with his co-authors Christophe Lovis and Nuno 
Santos, provide an overview of where we stand today. 

Most known exoplanets were discovered, through 
planetary transits of the parent star’s disk, by the Kepler 
space telescope. Jack Lissauer, Rebekah Dawson and 
Scott Tremaine assess the highlights of the mission. 

Adam Burrows goes on to look at our current theoretical 
understanding of exoplanets and their atmospheres. 

The path for Kepler was blazed by the under-rated 
Convection, Rotation and Planetary Transits (CoRoT) 
mission, with some help from Microvariability and 
Oscillation of Stars (MOST). Artie Hatzes fills us in on 
what those missions found. 

Finally, the precise instruments needed to measure 
the radial-velocity shifts of stars as they and their 
planets co-orbit the system's centre of mass, along with 
present and future instruments to better characterize the 
planetary atmospheres, are reviewed and anticipated by 
Francesco Pepe, David Ehrenreich and Michael Meyer. 

Thope that you enjoy this collection as much as I have 
enjoyed seeing history pass through my hands at Nature. 


I is hard to imagine now, and the younger people 


Leslie Sage 
Astronomy Editor 


CONTENTS 


REVIEWS 


328 


336 


345 


353 


358 


Doppler spectroscopy as a path to the 
detection of Earth-like planets 

Michel Mayor, Christophe Lovis & Nuno 
C. Santos 


Advances in exoplanet science from 
Kepler 

Jack J. Lissauer, Rebekah |. Dawson & 
Scott Tremaine 


Highlights in the study of exoplanet 
atmospheres 
Adam S. Burrows 


The role of space telescopes in 
the characterization of transiting 
exoplanets 

Artie P Hatzes 


Instrumentation for the detection and 
characterization of exoplanets 
Francesco Pepe, David Ehrenreich & 
Michael R. Meyer 


18 SEPTEMBER 2014 | VOL 513 | NATURE | 327 


© 2014 Macmillan Publishers Limited. All rights reserved 


REVIEW 


doi:10.1038/nature13780 


Doppler spectroscopy as a path to 
the detection of Earth-like planets 


2,3 


Michel Mayor’, Christophe Lovis' & Nuno C. Santos 


Doppler spectroscopy was the first technique used to reveal the existence of extrasolar planetary systems hosted by solar- 
type stars. Radial-velocity surveys led to the detection of a rich population of super-Earths and Neptune-type planets. 
The numerous detected systems revealed a remarkable diversity. Combining Doppler measurements with photometric 
observations of planets transiting their host stars further provides access to the planet bulk density, a first step towards 
comparative exoplanetology. The development of new high-precision spectrographs and space-based facilities will ulti- 
mately lead us to characterize rocky planets in the habitable zone of our close stellar neighbours. 


cal instrumentation and the scientific development of new 

observational techniques made it possible to transform the old 
philosophical concept of ‘plurality of worlds’ in the Universe into an 
active field of modern astrophysics. Today, almost 2,000 planets orbiting 
other stars are known, and we are contemplating an even more exciting 
challenge: discovering Earth-like exoplanets with physical conditions 
suitable for the complex chemistry of life to develop. 

Some of the most important discoveries in this field have been made 
using the technique of Doppler spectroscopy. These results are the focus 
of this Review. They illustrate the tremendous progress that has been 
made in our understanding of exoplanet populations in the Galaxy, and 
the role of the stellar environment in the formation of planetary systems. 

The discovery of a whole new population of planets orbiting other 
stars has now moved the focus of exoplanet researchers to two main 
areas: the search for planets of lower and lower mass, and the precise 
characterization of the new-found planets. In the years to come, the rise 
of a new set of experiments, including ground-based giant telescopes 
and space-based missions dedicated to the detection and characteriza- 
tion of planets hosted by bright stars, will allow the next big steps in 
this research. These efforts will bring us closer to the goal of detecting 
and characterizing Earth-like exoplanets of rocky composition orbiting 
within the habitable zone of their host star. 


D uring the past three decades, the development of astronomi- 


Early history 
How many planets are there in the Milky Way? How many planets are 
similar to Earth? It is interesting to look at the astronomical literature 
of the twentieth century for estimations of the number of planetary sys- 
tems in the Galaxy. Before 1943, the values ranged from zero to, at most, 
a few systems. The formation of protoplanetary gaseous nebulae was 
thought to result from the tidal capture of a stellar envelope through a 
close encounter with another star’. The extremely low probability of such 
a small impact collision was at the origin of these quite pessimistic esti- 
mations of number of planetary systems. In the early 1940s, claims of the 
discovery of several systems” , later found to be false, induced, in a couple 
of years, a complete paradigm shift’. Those estimates jumped to billions 
if not hundreds of billions. It is interesting that such a drastic change of 
thought was the result of spurious detections of planetary systems. 

The use of variation of stellar radial velocity due to gravitational inter- 
action with a massive planet was suggested as a detection method long 


before spectrographs achieved the high precision needed for such detec- 
tions”®. The radial-velocity technique, based on the variable Doppler shift 
of stellar absorption lines, is able to measure planetary orbital period, 
orbital eccentricity and minimum mass (Msini). The amplitude of radial- 
velocity variations depends on the planet mass and orbital distance. In 
the Solar System, Jupiter induces a 12 ms‘ radial-velocity signal on the 
Sun with a periodicity of 12 years, whereas Earth imprints a tiny 0.1 ms" 
signal at a 1-year period. The corresponding Doppler shifts on the stellar 
spectrum are, however, extremely challenging to measure (~10*-10°"° 
of the wavelength), which hampered progress in this field for decades. 

It was only during the 1980s that several ideas and technological solu- 
tions were proposed for new spectrographs, allowing radial-velocity 
precision of a couple of dozen metres per second’. Among the pio- 
neers, credit has to be given to Campbell and Walker’ for their survey 
of around 20 stars. With a hydrogen-fluoride absorption cell in front 
of their spectrograph, they demonstrated a radial-velocity precision of 
the order 15 ms '. However, at the end of many years of monitoring, 
their efforts obtained a negative result: no detection of Jupiter analogues 
orbiting their small stellar sample of solar-type stars*. Another survey 
was initiated by Marcy and Butler’ in 1988 at the Lick Observatory. 
Their iodine absorption cell gave, at that time, a similar precision of 
about 15 ms’. The result, in 1994, of that survey was similar to the 
earlier one: no Jupiter analogues were found around 25 solar-type stars’. 

At the same time as these surveys of limited size, a few teams were 
operating efficient spectrographs of moderate precision (250-500 m s ') 
but on large stellar samples. Among the many thousands of stars sur- 
veyed (mostly the main sequence stars F, G, K and M), a few stars were 
used as standard by the different teams and provided dozens to hun- 
dreds of radial-velocity measurements. When analysing the velocities of 
one of these objects, HD 114762, Latham et al." founda periodic varia- 
tion of 84 days and an amplitude corresponding to a possible companion 
of 11 times the mass of Jupiter (M,), on an eccentric orbit. Combin- 
ing their data with complementary measurements acquired at Haute- 
Provence Observatory allowed the publication of a very precise orbit”. 

Was this companion a planet or low-mass brown dwarf? At that 
epoch, the community was inclined towards the second option — a 
result of its short period, rather large values for the orbital eccen- 
tricity and mass, all characteristics not expected for a gaseous giant 
planet similar to the ones of our Solar System. Based on the present 
observed diversity of detected exoplanets, this consensus is certainly 


1Geneva Observatory, University of Geneva, 51 Chemin des Maillettes, 1290 Versoix, Switzerland. *Centro de Astrofisica e Departamento de Fisica e Astronomia, Faculdade de Ciéncias, Universidade 
do Porto, Rua das Estrelas, 4150-762 Porto, Portugal. “Instituto de Astrofisica e Ciéncias do Espaco, Centro de Astrofisica da Universidade do Porto, Rua das Estrelas, 4150-762 Porto, Portugal. 
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not a definitive conclusion. However, one characteristic should be 
mentioned: HD 114762 is a metal-deficient star (for which metallicity 
[Fe/H] = —0.7, where [Fe/H] = log[A,,/Ay] star — log(A,,/A,,) Sun and A 
is the abundance ofa given chemical element). According to present-day 
observations and to state-of-the-art models of planetary formation, it 
seems difficult to form a massive planet in such a metal-poor environ- 
ment! (see ‘Chemical clues for stars with planets’). For instance, a recent 
high-precision 10-year-survey of more than 100 solar-type stars has not 
revealed one single gas-giant planet with metallicity significantly lower 
than —0.5 (ref. 12). In contrast with planet formation, the formation of 
low-mass stars is not strongly constrained by the metallicity of the star- 
formation environment. These facts suggest that HD 114762b is most 
likely to be a low-mass stellar companion. We should note, however, that 
a few low-mass companions with metallicity close to that of HD 114762 
have been detected’*"’. HD 114762 is the most massive of these outliers. 


The discovery of 51 Pegasi b and its strange properties 

At the beginning of the 1990s, two different approaches were used to 
determine precise stellar radial velocities. On the one hand, spectro- 
graphs with absorption cells in the beam of the spectrograph (hydro- 
gen-fluoride cell or iodine cell)'*”’, and on the other hand, fibre-fed 
spectrographs with simultaneous calibration provided by a thorium 
lamp in a parallel fibre'®. Both methods were aimed at providing a 
precise calibration in wavelength. In 1995, a comparable precision 
(15 ms ') wasachieved by both techniques. However, one positive char- 
acteristic of the double-fibre spectrograph was its ability to obtain the 
final radial-velocity value a few seconds after the end of the observation 
sequence (an achievement not possible at the time for spectrographs 
using the absorption-cell technique). Furthermore, the double-fibre 
technique is more efficient in terms of photon noise, a crucial point to 
allow radial-velocity monitoring ofa large sample of stars with moder- 
ate-sized telescopes. 

In April 1994, with the new ELODIE spectrograph at Haute-Provence 
Observatory”® (using the simultaneous calibration technique), Mayor 
and Queloz” initiated a systematic survey of 142 solar-type stars to 
search for brown dwarfs or giant planets. Included in that sample was 
51 Pegasi, a metal-rich G2V type star, which was found to exhibit a peri- 
odic variation of its velocity with a period as short as 4.2 days. If result- 
ing from the influence of a companion, the observed amplitude would 
indicate a minimum mass of a little less than half the mass of Jupiter’. 
This discovery revealed the first exoplanet hosted by a solar-type star 
and a first example of the family of so-called hot Jupiters. 

Interestingly, such a short period was quite unexpected. The formation 
of gas-giant planets by agglomeration of ice particles was not supposed to 
be possible inside the ‘ice-line’'*. However, soon after this first detection, 
Lin et al.”” showed that short-period gas-giant planets could result from 
the orbital migration of the young planet embedded in the accretion disk. 
This physical process was already described in the literature”, but never 
incorporated in scenarios of planetary system formation. 

Soon after the discovery of 51 Peg b, the detection of several short 
period planets was announced by Butler et al’. Clearly, 51 Peg b was 
not a unique object with exceptional characteristics. 

Despite the run of detections, not all the community was convinced 
that these unexpected objects with short periods were indeed planets. 
However, a few crucial observations played a significant part in con- 
firming their planetary nature. The detection of multi-planetary systems 
was strong evidence supporting the planetary explanation, but the most 
important observation was the first detection ofa planetary transit. 

HD 209458b is a hot Jupiter-like planet with an orbital period close 
to 3.5 days. During the night of the 9 September 1999, at the precise 
time derived from the radial-velocity ephemerides, the first transit of 
the planet was detected, this was followed by a second detection 7 days 
later*’. The same host star was also scrutinized by another team and 
the transit detected. The data also allowed researchers to derive the 
mean density of that gas giant, showing that it was as low as 0.3 g cm’, 
less than half the mean density of Saturn. Observation of the transit 
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was repeated with the Hubble Space Telescope the following year”. The 
amazing precision of that transit is a milestone of exoplanet research. 
Hot Jupiters are indeed real gas-giant planets. Not only did the transit 
curve put an end to alternative interpretations for the existence of hot 
Jupiters, but that observation, with its remarkable precision, also opened 
the door to space experiments dedicated to exoplanetary transits such as 
the Convection, Rotation and Planetary Transits (CoRoT) and Kepler 
missions, and future missions such as Transiting Exoplanet Survey Satel- 
lite (TESS), Characterising Exoplanet Satellite (CHEOPS) and Planetary 
Transits and Oscillations of Stars (PLATO). 

A recent result refined our knowledge of 51 Peg b. Observing high- 
resolution spectra of stars hosting planets, it is possible to detect spectral 
fingerprints of a planet’s atmosphere. Using this technique, significant 
absorption from carbon monoxide and water vapour were observed in 
the dayside atmosphere of 51 Peg b**. In this way, the radial velocity of 
the planet could be measured directly, allowing the determination of the 
planet/star mass ratio and orbital inclination. This gave a direct estimate 
of the mass of 51 Peg b of 0.46 M,. 


An explosion of discoveries 

In 1995, the radial-velocity precision achieved by the best instruments 
was about 15 ms‘. By 1996, improvements in the data-reduction 
software allowed the precision of the iodine-cell technique to be 
improved down to 3 ms‘ using the Hamilton Echelle Spectrometer at 
the Lick Observatory and Keck High Resolution Echelle Spectrometer 
(HIRES)”. The need to increase the precision of Doppler measurements 
is obvious because the amplitude of the radial-velocity wobble is directly 
proportional to the planetary mass. 

Soon after the discovery of 51 Peg b”, existing radial-velocity sur- 
veys of nearby solar-type stars were significantly expanded, and new 
ones started, with precision of 3-10 m s"' (refs 16, 27-31). Additional 
hot-Jupiter discoveries quickly followed, aided by the relative ease of 
detection of their radial-velocity signals”. Over the next two decades, 
several hundred giant exoplanets were found, spanning a wide range of 
mass and orbital distance. 

In 2003, a new gain in precision was achieved with the construction 
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Figure 1 | Exoplanet discoveries as a function of time. The plot shows 

the minimum mass of the planets discovered by radial-velocity surveys as 

a function of discovery epoch. The horizontal lines denote the position of 
Earth, Neptune and Jupiter in this plot. The lower envelope of the points is 
illustrated by the solid line. This plot shows the incredible decrease in mass of 
the discovered planets, reflecting the increasing precision of radial-velocity 
surveys. Earth-mass planets are presently within reach and have been detected. 
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Table 1| Some remarkable planetary systems discovered or characterized with Doppler spectroscopy 


System name Description 


Comments 


51 Pegasi!’ 
u Andromedae 


HD 209458 (refs 23, 24, 
102) 


HD 80606 (refs 103, 104) 
GJ 436 (refs 57, 73) 


1 hot Jupiter 
pO 3 gas giants within 2.5 AU 


1 transiting hot Jupiter 


1 transiting hot Jupiter 


1 transiting Neptune 


u Arae (refs 56, 105) 
55:Cne=e 


1 close-in Neptune, 3 gas giants within 5 AU 


1 transiting super-Earth, 4 gas giants within 6 AU 


HD 189733 (ref. 107) 
HD 149026 (ref. 108) 
HD 69830 (ref. 59) 

GJ 581 (refs 109, 110) 
XO-3 (ref. 111) 

HD 45364 (ref. 39) 

HD 40307 (refs 79, 112) 
GJ 1214 (ref. 71) 

GJ 876 (ref. 113) 


1 transiting hot Jupiter 

1 transiting hot Saturn 

3 Neptunes within 0.6 AU 

At least 2 super-Earths and 1 Neptune 
1 transiting super-Jupiter 

2 gas giants within 0.9 AU 

4 super-Earths within 0.6 AU 

1 transiting mini-Neptune 


1 close-in super-Earth, 2 gas giants and 1 Neptune 
within 0.33 AU 


WASP-8 (ref. 114) 
HD 10180 (ref. 115) 
HD 85512 (ref. 76) 
HD 97658 (ref. 69) 
GJ 3470 (ref. 72) 


1 transiting hot Jupiter 
Up to 7 planets within 3.5 Au, mostly Neptunes 
1 super-Earth at 0.26 AU 


1 transiting super-Earth 
1 transiting Neptune 
a Centauri B® 
GJ 667C”® 


1 short-period Earth-mass planet 


2 super-Earths 


of the High Accuracy Radial Velocity Planet Searcher (HARPS) spectro- 
graph at La Silla Observatory in Chile”. This fibre-fed vacuum spectro- 
graph allows routine precision better than 1 ms‘'. This is still the most 
precise instrument for exoplanet detection. Following this and other 
developments, a large number of systems with planets smaller than Nep- 
tune could be detected (Fig. 1). Especially striking is the continuous 
decrease in the mass of detected exoplanets — an amazing improvement 
from the 3,000 M, (where M, is the mass of Earth) of HD 114762 in 
1989, to the 150 M, of 51 Peg in 1995, down to 1.1 M; for the companion 
of a Centauri B in 2012 (ref. 33). 


Ensemble properties of exoplanets 

After the initial discovery phase, it became possible to derive unbiased 
exoplanet population statistics by quantifying detection efficiencies as a 
function of planet parameters (orbital period, mass and eccentricity). In 
this Review, we summarize the results of various attempts to character- 
ize the ensemble properties of exoplanets using the radial-velocity tech- 
nique. We complement these with an overview of the transit searches 
for hot Jupiters; these have also developed tremendously over the past 
decade. We restrict ourselves to ground-based surveys (see the Review 
by Lissauer et al. on page 336 for results of Kepler mission). 


Statistics of gas-giant planets 

Soon after the discovery of 51 Peg b, it became clear that short-period 
gas giants are relatively rare. Globally, early radial-velocity surveys 
mainly revealed a population of gas giants at orbital distances of 1-5 au™ 
(1 avis the Sun-Earth distance). Key characteristics of this popula- 
tion include**** an overall occurrence rate of about 15% (minimum 
mass Msini > 50 M,, orbital period (P) < 10 years); a mass distribution 
peaking at ~1-2 M, with a ‘brown dwarf desert’ above 10-20 M; a wide 
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First exoplanet found around a solar-type star 
First multi-planet system identified 


First transiting exoplanet found and well suited to atmospheric 
characterization owing to host-star brightness 


Highest known orbital eccentricity (e = 0.93) and misaligned orbit 


First transiting Neptune, close-in but eccentric orbit and orbiting a 
nearby M dwarf 


Dynamically packed system of giant planets with inner low-mass object 


Dynamically packed system of giant planets with inner low-mass and 
intermediate-density object 


Well suited to atmospheric characterization owing to host-star brightness 
Dense giant planet with large heavy element core 

First system of close-in, low-mass planets 

First compact, low-mass system around an M dwarf 

First planet showing a spin-orbit misalignment 

System in 3:2 mean motion resonance 

Compact low-mass system with a potentially habitable planet 

Low-mass, low-density object orbiting a nearby late M star 


Three outer planets locked in a 4:2:1 Laplace resonance similar to the Galilean 
moons of Jupiter 


Retrograde and misaligned orbit 
Most populated exoplanet system known so far 
Potentially habitable planet with a radial-velocity amplitude of 0.7 ms? 
ntermediate-density object orbiting a bright K dwarf 
earby M-dwarf host 


Closest planetary system to the Sun 


Potentially habitable planet 


distribution of orbital eccentricities that differs markedly from the low 
eccentricities seen in the Solar System; a higher metallicity of the host 
stars when compared with the average value found for their field star 
counterparts; and an overall occurrence rate of 1.05 + 0.26% for hot 
Jupiters”””, valid for planets with Msini > 0.1 M, and P < 10 days. 

We note that our knowledge of the period distribution is at present 
limited by the duration of radial-velocity surveys and the sampling of 
long-period signals. We stress the importance of continuing these pro- 
grammes for at least a decade to thoroughly probe the 5-10 au region 
of planetary systems, where the formation of giant planets is likely to 
be most efficient. This is also the region where significant overlaps with 
direct imaging and microlensing techniques are expected. 

In many cases, not one but several giant planets have been found in 
the same system*”*. Various types of dynamical configurations exist, 
from widely separated orbits to strongly interacting mean motion reso- 
nances*’. Owing to the compactness and proximity of such resonances, 
the dynamical stability of several systems is not a priori obvious and 
must to be probed by dedicated numerical integrations. In favourable 
cases, planet-planet interactions are sufficiently strong to affect radial 
velocities in a measurable way (and on orbital timescales), which then 
yields direct constraints on the inclination angles of the orbits and true 
masses of the planets*’. There are some notable examples illustrating the 
diversity of the population of giant planets (Table 1). 

So far, perhaps the most striking result concerning long-period 
giant planets has been their tendency to have high orbital eccentricities 
(median value of about 0.3). The standard scenario of planet forma- 
tion within a protoplanetary disk calls for orbits to be much closer to 
circular. The Solar System has long been seen as a prototypical exam- 
ple of this model. Clearly, the formation and evolution of planetary 
systems is generally much more complex than originally thought. 
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Strong gravitational interactions between giant planets after disk dis- 
sipation*’”, as well as the gravitational influence of bound or passing 
stellar companions”, probably have a crucial role in the evolution and 
the final shaping of planetary systems. In this context, a major question 
that has yet to be answered is how common Solar System analogues are; 
that is, systems whose dynamics are dominated by a massive gas giant 
on alow-eccentricity orbit at several astronomical units from the star. 


Insights from ground -based transit searches 

Soon after the discovery of the first transiting giant planet, 
HD 209458b”*™*, several ground-based efforts started to target the 
population of hot Jupiters through the photometric transit technique. 
Exoplanet transits mainly provide planetary orbital period, inclination 
and planet size (radius). Coupled with radial-velocity measurements, 
which provide the planetary mass, these can be used to derive the planet 
bulk density. This has been the main observational method to constrain 
the internal structure of exoplanets used so far. 

Although early transit-search attempts were plagued by insufficient 
precision and inefficient operations, more recent large-scale surveys 
such as Wide Angle Search for Planets (WASP) and Hungarian Auto- 
mated Telescope Network (HATNet)* have eventually unveiled hun- 
dreds of transiting hot Jupiters orbiting FGK dwarfs within about 200 pc 
of the Sun. Two key properties of this population are a planet-size distri- 
bution that shows an excess of anomalously large radii, hinting at an as 
yet poorly understood physical mechanism that injects or traps excess 
internal heat inside the planet“; and the existence of a sub-population 
of hot Jupiters whose orbital plane is misaligned with respect to the stel- 
lar equatorial plane”' (Fig. 2), preferentially found around hotter stars 
(with effective temperature (T.,) of more than 6,250 K)™. 

The formation and evolution of hot-Jupiter systems is a matter of 
active research and a full picture is still missing. For a long time the 
canonical scenario of inward migration within a protoplanetary disk 
prevailed, but the discovery of misaligned hot Jupiters has significantly 
changed this. It now seems clear that dynamical interactions between 
multiple giant planets, and/or interactions with massive outer com- 
panions (for example, Kozai oscillations), have a major role during or 
after planet formation”. This echoes the conclusions already drawn 
from the observed high eccentricities of longer-period giant planets. In 
those scenarios, planets are perturbed into orbits with high eccentricity 
and potentially high inclination. These are then circularized and rea- 
ligned with the star through tidal interactions between the planet and 
the star. Indeed, the convergence between observed spin-orbit align- 
ment and the existence of an outer stellar convection zone for cool stars 
(T¢¢< 6,250 K) points towards the influence of tides on the orbital evolu- 
tion of at least some of the known hot Jupiters. 


The rise of Neptunes and super-Earths 

Improvements in radial-velocity precision towards 1 ms’, coupled with 
dedicated observing strategies, opened a new search space for radial- 
velocity surveys. In 2004, three planets in the Neptune-mass range were 
found for the first time: p Arae c, GJ 436b and 55 Cnc ee In 2006, 
HD 69830 was found to be the first system of multiple low-mass planets 
on close-in orbits, an architecture that would later prove to be com- 
mon”. By 2008, it had become clear from the HARPS survey” that a 
large population of Neptunes and super-Earths exists on short-period 
orbits”, with a preliminary occurrence rate of 30% for such planets 
(Msini < 30 M,, P< 50 days). In 2011, early results from the space-based 
Kepler mission fully confirmed this picture and greatly expanded the 
landscape of small-planet studies®'. (The Kepler results are discussed 
in the Review by Lissauer et al. on page 336.) 

Only three radial-velocity programmes discovered a sufficient num- 
ber of low-mass planets (Msini < 30 M,) to allow meaningful statistical 
studies of this population. These are the HARPS survey for FGK stars 
(44 detections”), the HARPS survey of M dwarfs (10 detections”), 
and the Keck-HIRES survey of FGK stars (9 detections™). Key proper- 
ties of the low-mass population unveiled by the HARPS surveys can 
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Figure 2 | Rossiter-McLaughlin effect for probing the spin-orbit 
alignment of exoplanets. For transiting exoplanets, precise radial-velocity 
measurements are not only able to measure exoplanet mass, but also the sky- 
projected angle between the planet's orbital plane and the stellar equatorial 
plane by the Rossiter-McLaughlin (RM) effect. The technique relies on the 
selective occultation of approaching and receeding parts of the rotating stellar 
disk by the planet during transit, causing variable net Doppler shifts that are 
detectable as deformations of the stellar spectral lines. The RM effect has a 
duration equal to the transit duration (typically hours) and is superimposed 
onto the usually larger radial-velocity signal caused by the gravitational pull 
of the planet on its star, which has periodicity equal to the planet orbital 
period (typically days for hot Jupiters). Two examples obtained with the 
HARPS spectrograph are shown. a, The exoplanet HD 189733A b exhibits a 
symmetric RM effect with a positive-then-negative radial-velocity anomaly 
that is the signature of an aligned and prograde star-planet system'’*. b, The 
WASP-8A system, however, shows a clearly asymmetric RM effect caused by a 
planet on a retrograde and strongly misaligned orbit with respect to the stellar 
rotation axis’. 
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be summarized as follows: for FGK stars, a global occurrence rate of 
0.33 + 0.05 planets per star for Msini between 3 M,, and 30 M, and 
P.<50 days; a mass distribution showing a steep rise below 20 M,, with 
a continuously rising trend at least down to 5 M,; a multiplicity rate of 
at least 70% among systems with at least one Neptune or super-Earth, 
that is, most low-mass planets are found in multi-planet systems; for M 
dwarfs, a global occurrence rate of 0.40 + 0.11 planet per star for Msini 
between 3 M,,and 30 M, and P < 50 days; and an occurrence rate of 
super-Earths (1-10 M,) in the habitable zone of M dwarfs of 0.415333 
planets per star. 

The high occurrence rates have one key consequence: systems of 
multiple planets with masses between 1 M, and 20 M, orbiting within 
0.5-1.0 au represent the most common type of planetary systems in the 
Galaxy. This result is supported by planet population synthesis models". 
Moreover, such systems often exhibit a packed dynamical configura- 
tion, with little space left for stable orbits between consecutive planets 
(see Table 1 for examples of such compact systems). Planet formation 
and evolution scenarios must now account for the existence of this 
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population. Competing theories include convergent migration of sev- 
eral protoplanets within a disk towards the inner regions of the system™, 
and in situ formation of super-Earths or Neptunes within a disk that is 
significantly more massive than the Minimum Mass Solar Nebula”. 

Essential clues on the formation path of these planets will come from 
the study of their internal structure. A key question is whether vola- 
tiles (mainly water) are a significant constituent of the interiors, which 
would point to a formation beyond the ice line. Another open ques- 
tion is to what extent the prevalence of H/He envelopes is a function of 
planet mass, formation path and irradiation. These issues can be inves- 
tigated by discovering super-Earths and Neptunes transiting nearby 
bright stars; a precise mass and radius can be obtained for these planets 
and atmospheric composition can be studied. At present, there are six 
such planets (55 Cnc e668 HD 97658b™”", GJ 1214b”, GJ 3470b”, 
GJ 436b*””*, and HAT-P-11b”) with radius (R) < 6 R,, all of them dis- 
covered from the ground. The space missions CoRoT and Kepler have 
also provided a sample of objects with precisely-measured densities 
(see Review by Lissauer et al. on page 336). However, these targets are 
generally much more distant than those discovered by radial-velocity 
surveys, and therefore more difficult to characterize. 

Although still limited, the sample of low-mass planets with well- 
measured densities already shows a wide diversity of compositions 
(Fig. 3 gives an overview of our present knowledge of Neptunes and 
super-Earth mean densities; see ref. 75 for densities obtained from the 
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Figure 3 | Mass-density diagram for Neptunes and super-Earths. Few 
low-mass exoplanets have precise mass and radius measurements from which 
a reliable density can be derived. Here we show the mass and density of the 

21 Neptunes and super-Earths that have a mass measurement with better 

than 20% precision. A population of low-density objects can be seen below 
around 2.0 g cm”, indicating a substantial H/He envelope much like Uranus 
and Neptune. Another population of much denser objects is also revealed, 
indicating bulk compositions ranging from terrestrial to more volatile-rich 
(for example, H,O). The overall trend indicates lower densities towards higher 
masses. However, high-density objects seem to exist also at masses above 

10 M,, while low-density mini-Neptunes occur at masses of only a few Earth 
masses (M,). The planets GJ 1214b, HD 97658b and 55 Cnc e span a narrow 
mass range of 6-8 M, and have mean densities from 1.6 gcm™ (GJ 1214b) to 
almost 5 g cm‘ (55 Cnc e), indicating very different internal structures at a 
given mass. This hints at a complex mass-radius relationship for low-mass 
exoplanets that does not depend on mass (or radius) alone, but also on 
environmental effects related to the formation and evolution of planetary 
systems. Mass-density relations from internal structure models with various 
bulk compositions are superimposed onto the observations'”. For clarity, only 
a selection of the exoplanets are labelled. 
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Kepler results and radial-velocity follow-up). 

Finally, radial-velocity surveys have recently come tantalizingly close 
to discovering planets in the habitable zone. The GJ 581 and HD 85512 
systems comprise at least one super-Earth that could lie at the edge of 
habitability, depending on surface conditions””’. Moreover, the planets 
GJ 667C c and HD 40307¢ are located within the classical habitable 
zone’**°. However, because neither the bulk density nor the atmos- 
pheric composition of these worlds is known, all discussions about their 
habitability remain largely speculative. 

The discovery of such objects has been possible thanks to sub-metre- 
per-second radial-velocity precision and a careful analysis of the radial- 
velocity time series. At this level of precision, however, various physical 
phenomena in stellar photospheres contribute significant signals that 
can hide planetary radial-velocity signatures if not properly modelled. 
Solar-type stars have an outer convective envelope that exhibits vari- 
ability on different timescales. Granulation, magnetic features (such as 
cool spots, plages and faculae) and long-term activity cycles all induce 
radial-velocity variability at the metre-per-second level***. Under- 
standing how to diagnose and correct these effects is an active area 
of research **. State-of-the-art instrumentation and dense temporal 
sampling are the key to making progress in this field, and to ultimately 
push radial-velocity sensitivity down to the 10 cms" level. This is the 
realm of habitable, Earth-mass planets around solar-type stars. Con- 
sidering that HARPS has already detected planetary signals with an 
amplitude of 50 cms‘ (ref. 32), and that modelling of stellar signals is 
still in its infancy, the exploration of the habitable zone around nearby 
FGK and M dwarfs is within reach of the radial-velocity technique. That 
is the main goal of future Doppler instruments (for example, the Echelle 
Spectrograph for Rocky Exoplanet and Stable Spectroscopic Observa- 
tions (ESPRESSO) on the European Southern Observatory Very Large 
Telescope (ESO-VLT), see Review by Pepe et al. on page 358). 


Chemical clues for stars with planets 

One of the most crucial pieces of information to understand the prop- 
erties of the discovered planets and to access their formation process 
comes from the study of planet host stars. On the one hand, precise 
stellar parameters, such as the radius, are crucial if we want to measure 
precise values for the radius of a transiting planet**. On the other hand, 
the chemical composition ofa planet, both its interior and atmosphere, 
is also likely to be related to the chemical composition of the protostellar 
cloud, reflected in the composition of the stellar atmosphere”. The pre- 
cise derivation of stellar chemical abundances thus provides important 
clues to understanding the planets and their observed properties. 

Furthermore, a number of studies have pointed towards the existence 
of a strong relationship between the properties and frequency of the 
new-found planets and those of their host stars. Large spectroscopic 
studies**” confirmed initial suspicions of a positive correlation between 
the probability of finding a giant planet and the metal content of the 
stars (Fig. 4). Curiously, this strong metallicity-giant-planet correlation 
was not found for the lowest mass planets”””’. 

It was soon realized that this correlation for giant planets was a key 
aspect of understanding planet formation. The simple existence of such 
a correlation has been pointed out as a strong evidence for giant-planet 
formation through the core-accretion process”. The lack of correlation 
for the lower-mass planets is also in full agreement with the expectations 
from such models. Indeed, stars formed out of metal-rich clouds are 
expected to have a higher mass of solid elements in their protoplan- 
etary disks, thus leading to the formation of planet cores over a short 
timescale. These are then able to accrete gas and become giant planets. 
However, stars formed out of metal-poor clouds will not have much 
planet-forming material in their disks. The planets will grow slowly, 
never achieving enough mass to become giant planets. 

Recent results from a specific survey for giant planets orbiting a sam- 
ple of metal-poor stars was conducted with the Doppler velocimetry 
technique, using the HARPS spectrograph”. The results fully confirm 
the lower frequency of giant planets orbiting lower metallicity stars, 
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Figure 4| Metallicity distribution of planet-hosting stars. In the left panel, 
the frequency of giant planets as a function of stellar metallicity is shown 
based on results from the HARPS planet search programme. The dashed 

line shows a power-law fit to the histogram values. In the right panel, we 
present the same plot but for stars that host only Neptune- or super-Earth-like 
planets. By metallicity we denote the abundance (A) of iron relative to the 
Sun, [Fe/H] = log(A,,/Ay) star—log(A;./Ay) Sun. These plots show a clear 
correlation between the presence of giant planets and the metallicity of the 
star. This trend is not seen for stars hosting lower-mass planets (as in ref. 90). 


and point to a possible limit in metallicity, below which no giant planets 
may be formed. 

Further evidence for the planet-formation process comes from the 
study of specific elemental abundances. It is now becoming clear that 
the abundance of a elements plays an important part in the formation 
of planetary systems, particularly in metal-poor environments”. The 
role of the abundances of other elements is also under discussion; some 
curious trends are a matter of strong debate, for example the abundances 
of the light element lithium?>"’, its isotope lithium -6 (refs 95, 96) or 
other elements”. 

It is important to note that the role of stellar metallicity in the for- 
mation of different architectures of planetary systems has also been 
addressed. Recently, suspicions have been raised concerning the influ- 
ence of stellar metallicity on the orbital period of planets””’* — planets 
orbiting metal-poor stars have longer periods than those in metal-rich 
systems. These results show that metallicity is one of the most crucial 
ingredients in the formation of planetary systems, controlling not only 
the planet-formation efficiency, but also the outcome of the planet- 
formation process, including mass, composition and architecture. 


Future prospects 

How will the field be moved forward? Almost 20 years after the 
discovery of 51 Peg b, the field of exoplanets has reached maturity, 
but our knowledge remains patchy and exoplanet parameter space 
has not been explored systematically. Clearly, the future lies in the 
detection and full characterization of entire planetary systems around 
nearby bright stars, for which precision measurements of both the 
stellar and planetary parameters can be obtained. To this purpose, 
a wealth of ground-based and space-based facilities will be working 
together in a common effort. 

The search for transits of super-Earths and Neptunes around bright 
stars will be carried out by the Next Generation Transit Search (NGTS, 
in 2014), the ongoing MEarth project and other ground-based sur- 
veys; and by the TESS (in 2017), Kepler-K2 (in 2014) and PLATO (by 
2024) missions from space. PLATO in particular will detect trans- 
iting Earth-like planets in the habitable zone of nearby solar-type 
stars. At the same time, radial-velocity surveys using spectrographs 
such as HARPS, HARPS-N, Keck HIRES, Automated Planet Finder 
(APF) at Lick, ESPRESSO (to come into use in 2016), CARMENES 
(to begin work in 2015) and SPiROU (to begin in 2017) will continue 
the thorough exploration of planetary systems in the solar neighbour- 
hood, and will carry out the follow-up of the above-mentioned transit 
search missions to measure planet masses. The CHEOPS mission (in 
2017) will provide essential support to both radial-velocity and transit 


REVIEW 


surveys through a flexible high-precision photometric follow-up from 
space. In addition, the European Space Agency’s ongoing Gaia mis- 
sion will provide high-accuracy fundamental stellar parameters for 
all planet host stars, and, coupled to high-resolution spectroscopy 
from the ground, will greatly improve the achievable precision of plan- 
etary masses and radii. Gaia will also detect giant planets at interme- 
diate semi-major axes, complementing radial-velocity surveys and 
high-contrast imaging in an effort to fully explore planetary systems, 
including dynamically important gas giants on long-period orbits. 
By between 2020 and 2025 the exoplanet landscape will, therefore, 
offer a tantalizing collection of objects spanning the whole param- 
eter space and including terrestrial planets with habitable surface 
conditions. There is a clear path forward for finding the ‘best’ such 
planets — those that are closest to the Sun and most amenable to 
further characterization. Not only will their bulk composition be 
well constrained, but also their atmospheres will be probed with 
techniques such as transmission spectroscopy (primary transit) and 
emission spectroscopy (secondary eclipse). The James Webb Space 
Telescope (launching in 2018) will lead these efforts, complemented 
by high-resolution spectroscopy from the ground with the future 
extremely large telescopes. We are lucky enough to live in a time in 
which humans are, for the first time, contemplating the realistic pos- 
sibility of exploring other planets similar to our own. Whether they 
will be few or plenty, orbiting a Sun-like star or an M dwarf, rocky 
or ocean worlds, with Earth-like atmospheres or more exotic ones, 
remains to be seen. This makes the quest all the more exciting. = 
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Advances in exoplanet 


science from Kepler 


Jack J. Lissauer!, Rebekah I. Dawson? & Scott Tremaine’ 


Numerous telescopes and techniques have been used to find and study extrasolar planets, but none has been more suc- 
cessful than NASA's Kepler space telescope. Kepler has discovered most of the known exoplanets, the smallest planets to 
orbit normal stars and the planets most likely to be similar to Earth. Most importantly, Kepler has provided us with our 
first look at the typical characteristics of planets and planetary systems for planets with sizes as small as, and orbits as 


large as, those of Earth. 


2009 (refs 1, 2). Kepler identifies those exoplanets whose orbits 
appen to appear ‘edge-or by searching for periodic dips caused 
by planetary transits (partial eclipses) of the stellar disks. Above Earth’s 
atmosphere, and in an Earth-trailing heliocentric orbit away from the 
glare and thermal variations of a low Earth orbit, Kepler monitored 
the brightness of more than 10° stars at 30-minute cadence for 4 years. 
Kepler’s unique asset was its unprecedented photometric precision of 
around 30 parts per million for 12th magnitude stars with data binned in 
6.5-hour intervals’. This is the benchmark time interval because Earth 
takes 13 hours to transit the Sun when viewed by a distant observer in 
the ecliptic plane (observers slightly away from the ecliptic view a transit 
of shorter duration). Such high-precision measurements are only pos- 
sible in space (where stars do not twinkle), and are required in order to 
search for Earth-like planets — because the transit of such a planet across 
the disk of a Sun-like star blocks only 80 parts per million of the stellar 
flux. For comparison, the transit of a Jupiter-size planet across a similar 
star blocks 1% of the flux, and this dip is straightforward to detect using 
ground-based telescopes. 

Transits of around 3,600 planet candidates, most of which represent 
true exoplanets, as described below, have been identified in the first 3 years 
of Kepler data (Fig. 1). The discovery of these planets, most of which have 
orbital periods (local ‘years’) shorter than a few Earth months, has greatly 
expanded the collection of known exoplanet types. Most Kepler planets 
have radii, R, that are intermediate between those of Earth and Neptune 
(1-3.8 R,; where R, = 6,371 km, Earth’s radius); planets in this size range 
are missing from our Solar System. These planets have a wide range of 
densities*”, probably because they have atmospheres with a wide range 
of properties. Nonetheless, theoretical models of their interiors’’ imply 
that all of the planets in this class are ‘gas-poor, that is, less than half — in 
most cases much less — of their mass consists of hydrogen and helium 
(H/He). By contrast, H/He make up more than 98% of our Sun's mass as 
well as substantial majorities of the masses of Jupiter, Saturn and almost 
all known giant exoplanets with R, >9R;. 

Kepler's primary mission is to conduct a statistical census of the abun- 
dance of planets as a function of planetary size, orbital period and stellar 
type. Kepler has found that planets are common, with the number of plan- 
ets in the extended solar neighbourhood of our Galaxy being comparable 
with, or larger than, the number of stars". Of particular interest is n,, the 
average number of Earth-like planets per star. ‘Earth-like’ means hav- 
ing a radius similar to that of Earth and receiving about as much energy 


Ke is a 0.95-m aperture space telescope launched by NASA in 


flux from its host star as Earth receives from our Sun; a more precise 
definition is given later. With some extrapolation downward in size and 
longward in orbital period, Kepler data suggest that n; is ~0.1, although, 
as discussed later, there is a broad range of estimates for this value. Earlier 
studies’*”* have found giant planets to be much more common around 
stars that are richer in heavy elements relative to light gases; Kepler data 
have shown that no comparable trend exists for small planets’*"*. Almost 
half of Kepler's planet candidates are in systems in which multiple transit- 
ing planets have been found. As we discuss, the large abundance of such 
systems implies that flat systems containing multiple planets on closely 
spaced orbits are quite common. This finding supports models of planet 
formation within a disk of material orbiting a star first developed by 
Immanuel Kant and Pierre-Simon Laplace. 

Because of Kepler’s success, NASA extended the operations of the 
spacecraft beyond the original baseline plan, but Kepler’s prime mission 
ended in May 2013 with the failure of a second reaction wheel that made 
precise stable pointing away from the spacecraft’s orbital plane impos- 
sible. Nevertheless, data analysis over the next few years is expected to 
reveal hundreds or even thousands of planet candidates in addition 
to the several thousand already discovered, probably including some 
that extend the range of exoplanets to smaller sizes and longer periods 
(Fig. 1, bottom right), and perhaps including true Earth analogues, in 
terms of size and period, that orbit Sun-like stars. These additional plan- 
ets, as well as better estimates of planetary sizes and planet detectability, 
will allow for improved estimates of the population of planets within our 
Galaxy. Although the partially disabled Kepler spacecraft cannot observe 
its original star field any longer, it has been reprogrammed to continue its 
search for other planets, with a focus on those orbiting small stars with 
orbital periods of less than 1 month; Kepler’s new mission is dubbed “K2’ 
Other space missions will continue to expand and exploit Kepler's discov- 
eries over the next decade. The European Space Agency (ESA) launched 
the Gaia astrometric spacecraft in 2013, which will determine precise 
distances to stars hosting planets discovered by Kepler, enabling more 
accurate determination of the sizes of these stars and their associated 
planets. NASAs Transiting Exoplanet Survey Satellite (TESS), scheduled 
to launch in 2017, will conduct an all-sky search for transiting planets 
around the nearest and brightest stars using small-aperture, wide-field 
optics’®. TESS planets will be easier to study with other space- and ground- 
based observatories than Kepler planets, most of which orbit much 
fainter stars. Searches for transiting planets in space will continue in the 
2020s with ESA’s Planetary Transits and Oscillations of Stars (PLATO) 
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mission, which will have an effective aperture almost as large as Kepler 
does, together with a much larger field of view. Analysis of data from 
these advanced observatories, together with associated theoretical studies, 
should advance the studies of exoplanets that were pioneered by Kepler 
far beyond the mission's original goals. 


Transiting planets and eclipsing binary stars 

The transit depth yields the ratio of the planetary radius to the stellar 
radius, and the repetition rate of transits tells us the planet's orbital period. 
The stellar colours — or, better yet, stellar spectrum — can be used to 
deduce the star’s radius and mass, and from these we can find the planet's 
radius and the semi-major axis of its orbit (from Kepler’s third law). In 
favourable cases (generally restricted to close-in planets that are subject 
to intense stellar irradiation), we can detect the occultation of the planet 
as it travels behind the star, and thus determine the planet's albedo (its 
reflectivity). A wide range of albedos are found for both small” and large 
planets’®, with most hot giant planets having low albedo. Planets in mul- 
tiple systems perturb one another through their mutual gravity, causing 
their orbits to deviate from strict periodicity. These deviations lead to 
transit timing variations (TTVs) that, in favourable cases, can be used 


to measure the planetary masses and additional orbital elements””*”". 


The objects in the catalogues assembled by the Kepler project” are 
considered to be only planet candidates because eclipsing binary stars 
can mimic transiting planets. Normally, the fractional brightness change 
in a binary-star eclipse is much larger than in a planetary transit, but 
occasionally the eclipse is grazing, or light from the Kepler-target star 
is diluted or ‘blended’ with the light from a background or companion 
eclipsing binary nearby on the plane of the sky. Such false-positives plague 
ground-based searches for exoplanets, but the Kepler light curves (star- 
light received as a function of time) are of such high quality that they can 
usually be used to discriminate between grazing or blended stellar eclipses 
and planetary transits. Moreover, Kepler is an imaging instrument, which 
can measure changes in the position of the image on the sky plane during 
transit. When multiple sources are present in the aperture used to meas- 
ure the light curve from a Kepler target, the photometric centroid moves 
in response to flux changes in any of the sources. This property allows 
Kepler's pixel-level data to be used to test scenarios in which a planetary 
transit-like event is produced by a diluted background eclipsing binary 
star’. This ‘centroiding’ weeds out most eclipsing binaries that are blended 
with background stars and some that are blended with companion stars. 
Thus, although well under half of Kepler's planet candidates have been 
verified to be true planets, the false-positive rate of the catalogue as a 
whole is probably less than 10%; however, it may exceed 30% for the larg- 
est planet candidates” **. Therefore, with appropriate care, the Kepler 
catalogue can be used for statistical studies of the exoplanet population. 

Most of the planet candidates were found by the automated Kepler data- 
analysis pipeline, but many of the larger candidates were first identified by 
eye, including several dozen using data from the public Kepler archive by 
citizen scientists through the Planet Hunters project”. Other groups have 
found dozens of planets with orbital periods of less than 1 day, for which 
the pipeline is not optimized”. 

Planet candidates that have been verified to be true planets at a high 
level of confidence (in most cases well above 99%) are assigned Kepler 
designations (names). Verification can take the form of dynamical con- 
firmation by detection of either TT Vs in the Kepler light curve or radial- 
velocity variations, or it can be based on statistical arguments showing 
that the likelihood of the planet hypothesis is much greater than that of 
other possible causes of the observed light curve” **. Typically, verified 
systems have been studied in greater detail than unverified candidates, 
resulting in more accurate estimates of stellar and planetary properties. 


Individual planets and planetary systems 
Kepler's primary mission was a statistical characterization of the exoplanet 
population. However, we first describe some of the highlights of the indi- 
vidual planets and planetary systems found. 

Kepler’s first major discovery was the Kepler-9 system, which contains 
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Figure 1 | Orbital period against planetary radius for planetary 
candidates. Coloured symbols represent the number of planetary candidates 
in the system found by analysing the first 3 years of Kepler data. The numbers 
shown above the figure represent the total number of systems of a given 
multiplicity in the catalogue; a small fraction of these planets fall outside the 
boundary of the period—radius ranges plotted. Planets with shorter orbital 
periods are over-represented because geometric factors and frequent transits 
make them easier for Kepler to detect. The upward slope in the lower envelope 
of these points is caused by the difficulty in detecting small planets with 
long orbital periods, for which transits are shallow and few are observed. 
The apparent absence of giant planets in multi-planet systems has been 
quantitatively confirmed’”. R,, radius of Earth; R,, radius of Jupiter; Ry, radius of 
Mars; Ry, radius of Neptune. Data provided by J. Rowe. 


two transiting giant planets with orbital periods (P) of 19.24 days and 
38.91 days. The period ratio (38.91/19.24 = 2.02) is close to the 2:1 orbital 
resonance, which induces TT Vs of tens of minutes in both planets. Analy- 
sis of these TT Vs enabled both planets to be confirmed and provided esti- 
mates of their masses: each planet is similar in size to Saturn but less than 
halfas massive’”*'. TTVs have also been used to discover non-transiting 
planets, such as Kepler-19c™ and Kepler-46c”>. 

The first rocky planet found by Kepler* was Kepler-10b, which 
has an R, of 1.42+0.04 R, and a mass, M,, of 4.64 1.2 M,, where 
M, = 5.978x 10” kg, Earth's mass. This planet's density, 8.8°33 g cm”, is 
consistent with an Earth-like composition. Remarkably, its orbital period 
is only 20 hours. 

Kepler-11 is a Sun-like star with six transiting planets that range in size 
from ~1.8-4.2 R,°”°. Orbital periods of the inner five of these planets are 
between 10 and 47 days, with the ratio of orbital periods between adjacent 
planets ranging from 1.26 to 1.74. For comparison, the ratio of orbital 
periods in the Solar System ranges from 1.63 (Venus and Earth) to 6.3 
(Mars and Jupiter). The outermost planet, Kepler-11g, has a period of 
118.4 days. TT Vs have been used to estimate the planets’ masses. Most, 
ifnot all, havea substantial fraction of their volume occupied by the light 
gases hydrogen (H,) and helium, implying that these gases can domi- 
nate the volume of a planet that is only a few times as massive as Earth. 
Kepler-36 hosts two planets, the semi-major axes of which differ by only 
10%, but whose compositions are markedly different’: rocky Kepler-36b 
has an M, of 4.5 + 0.3 M,, a density of 7.46 *}%5 gcm °andan orbital period 
of 13.84 days, whereas puffy Kepler-36c has an M, of 8.7 + 0.5 M,, a density 
of 0.89 “$07 gcm™ and P = 16.24 days. It is possible that the atmosphere of 
Kepler-36b was stripped by photoevaporation or impact erosion, whereas 
Kepler-36c was able to retain its atmosphere because of its larger core mass 
and slightly larger distance from the host star”’. The proximity of the orbits 
also presents a conundrum: although numerical integrations show that 
the current configuration may be long-lived, most nearby configurations 
are unstable on short timescales”, so it is far from clear how these planets 
arrived at their current orbits. 

The first transiting circumbinary planet to be discovered, Kepler-16b, 
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BOX1 
Kepler’s pre-launch goals 


Kepler’s goals were to explore the structure and diversity of 
extrasolar planetary systems and thereby to: 

@ determine the frequency of Earth-size and larger planets in or 
near the habitable zone of a wide variety of spectral types of stars; 
@ determine the distributions of size and orbital semi-major axes of 
these planets; 

@ estimate the frequency of planets in multiple-star systems; 

@ determine the distributions of semi-major axes, albedo, size, mass 
and density of short-period giant planets; 

@ identify additional members of each photometrically discovered 
planetary system using complementary techniques; 

@ determine the properties of those stars that harbour planetary 
systems. 


is an object of approximately Saturn's mass and radius (M,=106 +5 M,, 
R, = 8.27 + 0.03 R,), travelling on a nearly circular orbit (eccentricity 
e = 0.0069) with a period of 228.8 days around an eclipsing pair of stars 
with an orbital period of 41.08 days®. A bonus in such systems is that 
the planetary transits enable accurate measurements of the stellar masses 
and radii (errors $0.5%): one of the stars is about two-thirds the size and 
mass of our Sun and the other only a fifth as large as the Sun®. Moreover, 
the primary star’s rotation axis has been measured to be aligned with the 
binary’s orbital axis to within 2.4° (ref. 38). Several other circumbinary 
planets have been found using Kepler data, including the multi-planet 
Kepler-47 system”. 

Kepler-20e is the first planet smaller than Earth (R, = 0.87 “015 Ry) 
to be verified around a main-sequence star other than the Sun”; 
its 6.1-day orbit means that it is much too hot to be habitable. The 
low-mass (M-dwarf) star Kepler-42 hosts three planets smaller than 
Earth, the smallest of which is Mars-sized*'. Kepler-37b, only slightly 
larger than Earth’s Moon, is the first planet smaller than Mercury to 
be found orbiting a main-sequence star; its period is 13 days, and 
the stellar host is 80% as massive as the Sun”. KIC 12557548b exhib- 
its transits of varying depths, which might be due to an evaporating 
dusty atmosphere”. Kepler-78b™ has the shortest orbital period of 
any confirmed exoplanet, circling its star in 8.5 hours. This roasting 
world is slightly larger than Earth, and its mass, measured from the 
radial-velocity variations it induces in its nearby host star, implies a 
rocky composition*”*. 

Circumstellar habitable zones are conventionally defined to be the 
distances from stars at which planets with an atmosphere similar to 
that of Earth receive the right amount of stellar radiation to main- 
tain reservoirs of liquid water on their surfaces”’. Kepler-62f is the 
first known exoplanet whose size (1.41 + 0.07 R,) and orbital position 
suggest that it could well be a rocky world with stable liquid water 
at its surface™. 


Principal goals of the Kepler mission 
Discoveries like the ones mentioned have captured a great deal of 
attention in the scientific community and beyond. But Kepler is, in 
essence, a statistical mission, designed to discover large numbers of 
planets in a survey with well-characterized selection criteria. The 
stated goals of the Kepler mission before launch” are shown in Box 1. 
In its 4-year prime mission, Kepler observed almost 200,000 stars, 
including about 140,000 dwarf or main-sequence stars that were 
monitored for a substantial majority of that time. In addition to its 
contribution to exoplanet science, Kepler has revolutionized the field 
of asteroseismology, which probes stellar interiors by observing the 
surface manifestations of oscillations that propagate within stars*”', 
and has dramatically advanced our understanding of eclipsing binary 
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stars”, as well as other areas of stellar physics. This Review only con- 
siders stellar properties indirectly through their contributions to 
assessing planetary characteristics. 


How common are planets? 

The Kepler catalogue of planets is uniquely valuable for studying the 
structure and properties of planetary systems: it is large enough that we 
can map out the distribution of planets in multiple parameters (such as 
orbital period, radius, multiplicity and properties of the host star); it has, 
at least in principle, well-defined selection criteria (in contrast with radial- 
velocity catalogues, which come from many different surveys and which 
usually do not include null results); and most of the parameter space that it 
explores, typical radii of 1-3 R, and orbital periods of up to approximately 
1 year (Fig. 1), is not easily accessible by other techniques. 

One of the most fundamental statistics describing planetary systems is 
the probability distribution f(R,,P)dlnR,dlnP that a member of a speci- 
fied class of stars possesses a planet i in the infinitesimal area element*~* 
dlnR,dinP. The integral of this distribution over a range in planetary 
radius and orbital period is the average number of planets per star (not 
to be confused with the fraction of stars with planets, which is smaller). 
Kepler determines this distribution for Sun-like stars with reasonable 
accuracy for R,2 1 R; in the range P< 50 days, and for R,2 2 R, in the 
range P< 150 days. 

Occurrence rate calculations must carefully account for the complete- 
ness, reliability and threshold criteria of the Kepler catalogue™” °8 as well as 
random and systematic errors in host-star properties. For candidates with 
small transit depths or just a few transits, robust estimates of occurrence 
rates require calibration by injecting and recovering planetary signals in 
Kepler data” ™. The false-positive frequency distribution must be mod- 
elled simultaneously™. Revisions of host-star properties can dramatically 
alter the radius distribution of planets’ " So far, no occurrence rate calcula- 
tions contain all of these ingredients. Here we summarize key results from 
early analyses of the Kepler data set (see ref. 63 for a review). 

The studies cited above find that the number of planets per unit log 
period is nearly flat for R,< 4 R, and P> 10 days, but rises by a factor of 2-5 
between orbital periods of 10 days and 100 days for larger planetary radii. 
The number of planets drops sharply for orbital periods below 10 days for 
R,<4R, and below 2-3 days for giant planets. The occurrence rate of giant 
planets on small orbits is a factor of three lower than in radial-velocity 
surveys***™, perhaps because a significant fraction of giant planets are 
injected into small orbits through planet—planet gravitational interactions, 
and the relatively metal-poor Kepler stars host fewer and/or less massive 
planets, which are less likely to interact strongly. Atall periods the num- 
ber per log radius grows as the radius shrinks, at least down to radii of 2 R,. 
Below 2 R;, the distribution per unit log radius plateaus at orbital periods 
out to 50 days and probably out to 100 days**. The average number of 
planets per star with P< 50 days is ~0.2 for 1 R,<R,<2 R, and ~0.4 for all 
radii R, >1R, 5661 For stars cooler and less massive than the Sun, the aver- 
age number of planets per star is even higher": 0.49 “tos for 1 Ry <R,<2 Ry 
and 0,69 *}(¢ for all radii R, > 1 R, with P<50 days. The higher frequency 
is remarkable given that the fixed period cutoff at 50 days corresponds to 
a smaller semi-major axis in the less massive stars. 

One difficulty in discussing ng (the mean number of Earth-like planets 
per star) is that different authors use different definitions for “Earth-like: 
For solar-type stars the most natural definition is n;=f(1 R,, 1 yr); for 
other stars we can replace P= 1 yr with the period corresponding to the 
same incident stellar flux. Roughly speaking, this is the number of plan- 
ets per star in a range ofa factor of e in radius and period centred on the 
Earth’s radius and period. Unfortunately, determining n,; according to 
this definition requires an extrapolation downwards in size and longward 
in orbital period from the region where Kepler has a statistically reliable 
planet sample, which introduces considerable uncertainty. Applying this 
extrapolation to power-law fits” f(RyP) xP* of the distribution of plan- 
ets in the 16-month Kepler catalogue yields n,, = 0.09. An independent 
analysis of Kepler light curves” gives a consistent result, n,,=0.12 + 0.04, 
after renormalizing by a factor of 2.1 to convert their definition to ours. 
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However, a follow-up study” using the same catalogue, but a more gen- 
eral form for the period and radius distribution, found a much smaller 
value 1; = 0.02 *}8?. There are also other uncertainties: for example, none 
of the results for nj; discussed here model false positives (for planets of 
this size, the biggest contributor is larger planets orbiting faint stars that 
appear close to the Kepler target on the plane of the sky”*). Moreover, 
the extrapolations involve a mix of planets with and without substantial 
volatile envelopes (see below), as well as a likely mix of formation histories, 
and therefore the extrapolation may not capture the true occurrence rate. 

A related number for cool, low-mass stars is 0.155 “$58 planets per star 
with 0.5 R,.<R,< 1.4R, receiving 0.46-1.0 times the solar flux at Earth", 
which corresponds to a rate of 0.26 “}7@ in an interval equal to that which 


we use to define ng. 


The diverse physical properties of Kepler planets 

Kepler has discovered more than 3,000 planet candidates with radii 
R,<4 R;. Planetary interior models show warm planets of this size to 
be ‘gas-poor, defined here as composed of less than 50% H/He by mass. 
Transit surveys are well-suited to studying the physical properties of such 
planets because their radii are very sensitive to small amounts of gas in 
the atmosphere — for example, just 1% H/He added toa 1 M; and 1 Ry 
solid core can inflate the planet to 2 R,°’— and moderately sensitive to the 
bulk composition of the core (for example, water compared with rock). 
Furthermore, the gas-poor planets found by Kepler are valuable because 
they sample a wide range of incident fluxes and therefore were presum- 
ably subject to a wide range of photoevaporation rates; many are found at 
short orbital periods, allowing mass measurements or meaningful upper 
limits through radial-velocity follow-up studies’; and some are found in 
compact systems with multiple transiting, low-density planets, whose 
short orbital periods and large radii allow sensitive mass measurements 
through TTVs. 

Figure 2 shows the masses, radii and incident flux received by well- 
characterized planets less than 20 times as massive as Earth. The wide 
range in size of gas-poor planets of a given mass indicates a diversity of 
composition. It should be noted that most of the sub-Saturn exoplanets 
whose masses and radii are both known are Kepler discoveries. 

Various processes can affect the composition of gas-poor planets, 
including coagulation from volatile-rich or volatile-poor planetesimals, 
accretion of gas from the protoplanetary nebula if the planet forms before 
its dispersal, outgassing of volatiles from the planet's interior, atmospheric 
escape (for example, by photoevaporation), and erosion or enrichment 
of the atmosphere and mantle through collisions with planetesimals. 
Distinguishing which of these processes are at play and their relative 
contributions is both a challenge and a motivation for interpreting the 
measurements from Kepler. 

Several of the planets discussed earlier (highlighted in Fig. 2) have 
served as case studies to illuminate the properties of gas-poor planets. In 
particular, they sample a continuum of photoevaporation rates, which 
are a function of both incident stellar flux (as a proxy for the X-ray/ultra- 
violet radiation responsible for atmospheric erosion) and core mass. The 
middle four planets of Kepler-11, each of which contains ~4-15% H/He 
by mass, might represent the pristine, uneroded initial compositions of 
gas-poor planets, whereas the innermost planet in the system, Kepler-11b, 
is only 0.5% H/He by mass (or maybe devoid of light gases entirely if it 
is water-rich), perhaps because it has undergone considerable mass loss 
from its primordial atmosphere”. Kepler-10b, with an incident flux about 
30 times that of Kepler-11b, may in turn have lost all of its atmosphere to 
photoevaporation. This speculation is based on the density of Kepler-10b, 
which can be matched by theoretical models that do not require a volatile 
component’. In addition to lifetime-integrated X-ray and ultraviolet flux, 
core mass is an important factor in determining the photoevaporation 
rate. A larger core mass for Kepler-36c may have enabled it to maintain 
its atmosphere against photoevaporation, which may have stripped its 
nearby neighbour Kepler-36b”. 

There is now a large collection of gas-poor Kepler planets with masses 
that are individually less precisely measured than the case studies above, 
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but nonetheless statistically powerful when analysed as an ensemble. 
Dozens of masses have been measured through radial-velocity follow- 
up’. Using an approach that accounts for degeneracies between mass 
and eccentricity, more than 100 planetary masses were estimated from 
TTVs”. Furthermore, theoretical models imply that radii of warm planets 
in the size range 2—4 R, depend much more on H/He percentage than 
on total planet mass”. Thus, the larger sample of thousands of Kepler 
candidates with R,<4Rj even iflacking measured masses, informs us of 
planetary occurrence rates as a function of composition and their correla- 
tions with other properties such as the orbital period and the mass and 
chemical composition of the host star, which in turn constrain models 
for the formation and evolution of gas-poor planets. 

Several radius ranges may indicate different regimes for planet for- 
mation and evolution (Fig. 3). For R, $1.6 R,, most of the small number 
of transiting planets that have measured masses are dense enough to 
be rocky; by contrast, larger planets seem to require a volatile compo- 
nent”’. This radius range includes all planets with a period of less than 
1 day”; these planets may once have had atmospheres that have now 
been stripped by impacts, photoevaporation, stellar winds and/or tidal 
forces. For 1.6R,SR,<3R,, the mass-radius relationship is consistent 
with M, & R,, indicating that the typical planetary density decreases 
with increasing size”’””. This mass—radius relationship requires a sub- 
stantial mass fraction of water or a small mass fraction (0.1-5%) ina H/ 
He atmosphere”. The scatter in the mass—radius relationship exceeds 
the measurement errors”, indicating some diversity in composition 
and/or atmospheric properties that possibly includes rare rocky planets 
without voluminous atmospheres. The presence ofa H/He atmosphere 
substantially increases the temperature at the rocky surface; thus plan- 
ets such as Kepler-22b, which has a radius of 2.4 R,”’, are unlikely to 
be habitable. For 3RpSR,S 7R;, the planets are less dense than water, 
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Figure 2 | Mass-radius plot for transiting exoplanets with measured masses 
less than 20 M,, with the model curves for different compositions. Planets 
are colour-coded by the incident bolometric flux (F) they receive in units of 
the flux impinging on Earth. Kepler planets are shown by filled circles, with 
numbers and letters indicating planets discussed in this Review; the rocky 
planets in the crowded region near the lower left include Kepler-10b (red) and 
Kepler-36b (yellow). Other known exoplanets in this mass range are shown 

by open squares. The Solar System planets Venus (V), Earth (E), Uranus (U) 
and Neptune (N) are shown. The lower curve is for an Earth-like composition 
with two-thirds rock and one-third iron by mass. All other curves use thermal 
evolution calculations’”, assuming a volatile atmosphere of H/He or water 
atop a core of rock and iron with a composition the same as that of the bulk 
Earth. The two blue curves are for 50% and 100% water by mass and the two 
orange curves are for H/He atmospheres atop Earth-composition cores. These 
theoretical curves assume a radiation flux 100 times as large as that received by 
Earth and an age of 5 Gyr. Figure courtesy of E. Lopez. 
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implying voluminous H/He atmospheres”. The occurrence rate plum- 
mets between 2 R, and 3 R,°””, so this class is much rarer than the first 
two. Few planets in this class have been found around low-mass stars”’. 
Large planets of R,24R, are more common around stars with larger 
abundances of elements heavier than helium'*””. 

The class of rocky planets with R, $1.6 R, may lack gaseous atmos- 
pheres either because they were unable to accrete significant amounts of 
light gas and never outgassed an atmosphere, or because their primordial 
atmospheres were removed by impacts or photoevaporation. A possible 
explanation for the plunge in occurrence rate between 2 R, and 3 R; is 
that larger planets need H/He envelopes, which are uncommon, but most 
low-density smaller planets contain substantial water components or very 
low-mass outgassed H, envelopes. The paucity of planets larger than 3 Ry 
around low-mass stars and the higher heavy-element abundance in the 
host stars of systems containing planets larger than 4 R, may both reflect 
the difficulty of accreting H/He envelopes in a protoplanetary disk with 
a low surface density in solids. 


Properties of planetary systems 

The Kepler catalogue is particularly important because it contains many 
multiple-planet systems, roughly eight times as many as all radial-velocity 
surveys combined™. Multiple systems are expected to have a very low 
false-positive rate ($1%), because background binary-star eclipses may 
mimic the light curve from a single transiting planet, but are unlikely 
to imitate two or more™”*. Multiple systems are also valuable because 
gravitational interactions among the planets lead to TTVs that in some 
cases allow us to determine the masses and orbital properties of one or 
more of the planets*”*”?*". 

Multiple systems also allow us to constrain the average mutual inclina- 
tions of the planetary orbits, either by comparing relative transit durations 
and orbital periods” or by comparing the frequencies of systems with dif- 
ferent multiplicities in the Kepler survey to those in radial-velocity surveys 
(since the chance that multiple planets in a single system will transit is 
much higher if their mutual inclination is low”). Such studies show that 
the typical mutual inclinations in Kepler planets are only a few degrees, 
similar to those in the Solar System. Another probe is the mutual inclina- 
tion between a transiting planet and its non-transiting perturber, which 
is not biased towards low mutual inclinations by the selection effects that 
are present in multi-transiting systems; the first such systems with good 
constraints have been found to be flat*””*. 

The finding that typical Kepler multi-planet systems are flat is per- 
haps the first direct evidence that most planetary systems formed from a 
rotating thin disk of gas and dust, as suggested by Pierre-Simon Laplace 
more than two centuries ago. But even this widely accepted result leads 
to tension with other observations. In most formation models, plan- 
ets have mean inclinations that are at least half as large as the mean 
eccentricities”’, and this result also holds for the planets in the Solar 
System, the asteroids and the Kuiper belt. Thus, we expect the mean 
eccentricity of the Kepler planets to be no more than about 0.1. Unfor- 
tunately, attempts to measure the eccentricity distribution of Kepler 
planets have been complicated by (and sometimes brought to light) sys- 
tematic uncertainties in the stellar properties”, although individual 
constraints have been possible for a subset of well-characterized stars 
with high signal-to-noise transits**™*. By contrast, the eccentricities of 
radial-velocity planets are straightforward to measure, and the mean 
eccentricity for those having orbital periods larger than 10 days is 0.26, 
much larger than we would expect from the arguments above. Are the 
eccentricities and inclinations of the radial-velocity planets larger than 
those of the Kepler planets? Or perhaps just larger than those of planets 
in Kepler’s multiple planet systems? And if so, why? Could the eccen- 
tricities be over-estimated*’? Or could exoplanets have much larger 
eccentricities than inclinations*”? 

An equally serious tension is revealed by ground-based measure- 
ments of the stellar obliquity, the angle between the equator of the host 
star and the orbital plane of a transiting planet. About 80 obliquities — 
or at least their projections on the sky plane — have been determined, 
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mostly through measurements of the Rossiter-McLaughlin effect”. 
Almost half of the measured projected obliquities exceed 20°, and 15% 
exceed 90°; of course the width of the distribution of true (as opposed 
to projected) obliquities must be even larger. This result is quite differ- 
ent from the expectation for a Laplace-type model, in which the host 
star and planets form from a single rotating gas disk, and thus should 
have a common spin and orbital axis. One possibility is that the close-in 
giant planets arrived on their present orbits through high-eccentricity 
migration, which excites large obliquities**. Another possibility is that 
the stellar spin is misaligned with the axis of the planetary disk, perhaps 
because of a collision with a giant planet on a highly eccentric orbit or 
twisting of the planetary disk by external torques*”*'. To complicate the 
situation further, most of the handful of Kepler planets for which meas- 
urements are available, including multi-planet systems, have obliquities 
near zero"?”?, 

The properties of multi-planet systems are constrained by the require- 
ment that they be dynamically stable over timescales comparable with 
the lifetime of the star. Rigorous stability criteria are not usually available 
except for two-planet systems”, but a useful approximate criterion is that 
systems composed of planets on nearly circular, nearly co-planar orbits are 
stable for N orbits if the separation in semi-major axis between adjacent 
planets, a,,, a, exceeds some constant Ky times the mutual Hill radius 
(the separation at which the mutual planetary gravity equals the difference 
in the pull of the star on the two planets), 


Gove ( M+ Mus i ai +a 

* 
where M,and M,,, are the planet masses and M, is the mass of the host 
star. For N=10"°, Ky+ 9-12 (ref.95). As one would expect, most of the 
Kepler multi-planet systems are safely stable by this criterion (Fig. 4), and 
numerical integrations assuming initially circular, co-planar orbits con- 
firm that almost all of them are stable for at least 10'° orbits”*”*. These 
results depend on the assumed mass-radius relationship, but are relatively 
insensitive to it because the planet mass enters the definition of the Hill 
radius to the 1/3 power. 

A deeper question is whether these systems are dynamically ‘full’ or 
‘packed? which we define to mean that no additional planets, even with 
very small masses, could be inserted between the existing ones ina stable 
orbital configuration. The situation in our own Solar System is ambigu- 
ous: the region from Jupiter to Neptune is packed, or nearly so”, but inside 
of Mercury and between Earth and Mars there are significant bands in 
semi-major axis where additional low-mass planets would be stable for 
at least 10° years”. 

Dynamically packed systems are a natural consequence of models in 
which planets grow hierarchically, since planet growth should stop once 
all of the orbits are stable. However, the correspondence is not exact, since 
stable zones may not be occupied if the planets they contained collided in 
the final stages of hierarchical growth; moreover, the system may contain 
additional planets that are not transiting or fall below Kepler's detection 
threshold. The evidence (Fig. 4) suggests that the known Kepler planets 
are typically separated by about twice the distance required for stability, 
but given the possible presence of undiscovered planets and the desta- 
bilizing effects of non-zero eccentricity, many of these systems may be 
dynamically packed”. 

Planets are also found in binary star systems, either orbiting one of 
the two stars with an orbital period much shorter than the binary period 
(‘S-type’) or orbiting both with a period much longer than the binary 
period (‘P-type or ‘circumbinary’). Most of our understanding of S-type 
planets comes from radial-velocity studies, whereas circumbinary planets 
around normal stars were first discovered by Kepler. In most respects 
the properties of planetary systems in single and binary-star systems are 
similar", although S-type planets seem to be less common in binary 
systems than similar planets around single stars’. 

Binary systems offer unique insights into the formation of both planets 
and stars. Torques from the companion star in an S-type binary can excite 
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slow, large-amplitude Lidov—Kozai oscillations in the eccentricity and 
inclination of the planetary orbit. One striking hint that Lidov-Kozai 
oscillations are sometimes at work is that the four planets with the larg- 
est eccentricities (e > 0.85) are all members of wide S-type binary sys- 
tems'”. A close companion star truncates the protoplanetary disk and 
the planetary system ata radius of about 0.25-0.3 times the companion’s 
separation (for equal-mass stars on a circular orbit"). No S-type planets 
have been discovered, either in radial-velocity or transit surveys, in binary 
systems with separation less than about 10 times the Earth-Sun distance. 
This could mean that the site of planet formation in the protoplanetary 
disk is beyond 3 times (0.3 x 10) the Earth-Sun distance, consistent with 
theories in which planets found at smaller radii have migrated inward; 
alternatively, the outermost regions at which circumstellar orbits are stable 
may nonetheless be too perturbed for planets to form. If binary stars form 
through dynamical interactions between single stars in a dense gas-free 
cluster, it would be difficult for them to acquire circumbinary planets. 
However, if binaries form through fragmentation and collapse in a gas- 
rich environment, they are likely to acquire a circumbinary disk in which 
planets could form. Binary stars with orbital periods of a few days are 
likely to be formed from binaries with much longer periods through high- 
eccentricity migration induced by a tertiary companion”. This process 
would probably remove or destroy any planets initially orbiting one of the 
two stars and is unlikely to produce circumbinary planets detectable by 
Kepler. Therefore, we should not expect to find planets, S-type or P-type, 
in binary systems with periods ofa few days or less, and this expectation 
is so far confirmed by the observations — the shortest-period planet- 
hosting binary star is Kepler-47 with a period of 7.45 days”. 

Just as important as the discoveries made by Kepler are its non-discov- 
eries. So far, Kepler has found no co-orbital planets, which share the same 
average semi-major axis — like the Trojan asteroids found accompany- 
ing Jupiter and the Saturnian satellites Janus and Epimetheus. It has also 
found neither exomoons nor ‘binary’ planets orbiting one another'*™"”’, 


Planet formation 

The mass fraction of stellar material other than the dominant elements of 
hydrogen and helium (the metallicity, in astronomical parlance) ranges 
from a few per cent down to around 0.01% among stars in the solar neigh- 
bourhood. The initial protostellar disk presumably has the same composi- 
tion as its host star, and these heavier elements are the raw material from 
which most of the mass in a typical Kepler planet must be drawn. Thus 
it is natural to expect that planet formation should be easier around stars 
having high metallicity. This expectation is confirmed for the giant planets 
detected in radial-velocity surveys — the fraction of high-metallicity stars 
hosting giant planets is much larger than the fraction of low-metallicity 
stars'*?1°71°8. Similarly, a star in the Kepler catalogue with super-solar 
metallicity is around 2.5 times more likely to host a large planet (R,>5 R,) 
than a star with sub-solar metallicity’. Remarkably, there is no such cor- 
relation for small planets (R,< 2 R,). The probabilities that a Kepler star 
with super-solar or sub-solar metallicity hosts a small planet are approxi- 
mately equal’”, with a significant number of small Kepler planets found 
around stars with metallicity as small as one-quarter that of the Sun“. 
Perhaps this is a hint that the formation process for small planets has more 
than enough metals to draw on even in moderately low-metallicity disks. 
Before any firm conclusions are drawn we need reliable metallicities for 
a larger sample of Kepler stars and planet-frequency measurements for 
stars with a wider range of metallicities. 

One of the most basic questions about the Kepler planets is whether 
they formed in situ’*"”° or whether they migrated to their current orbits 
from larger radii‘. The orbital periods of most of Kepler’s planets 
are $50 days, resulting in nominal formation timescales that are much 
shorter than the lifetime of the gas disk, unlike those of the Solar Sys- 
tem’s terrestrial planets. Thus the planets and their gaseous envelopes 
could have formed in situ. The main argument for migration is that it is 
a robust process’ that inevitably occurs in both analytical models and 
numerical simulations of planets orbiting in gaseous disks’*. However, 
models of migration have not successfully predicted any populations of 
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Figure 3 | Plausible compositions of the small and mid-sized planets 
observed by Kepler. The bars indicate the approximate relative occurrence 
rates °°” of planets in the specified size range. Plausible compositions in each 
size range, in terms of rock, water and light gases, are illustrated above. The 
arrows indicate physical processes that set or alter planet compositions. The 
smallest planets, on the left, can be rocky or possibly mixtures of rock and 
water. Somewhat larger planets have volumetrically significant amounts of 
constituents less dense than rock. Planets whose sizes are comparable to or 
larger than that of Neptune, R,=3.8 R,, have envelopes composed of the lightest 
gases, H, and He. The stars at the bottom of the figure indicate the masses 

and compositions of the stellar types that commonly host planets of the sizes 
indicated, with the half star indicating that although metal-poor stars host a 
significant number of planets in this size range, they are less common hosts. 


planets before they were observed. 

Additional insight into the migration process comes from planets in 
mean-motion resonances. In the strongest of these, the orbital periods 
of the two resonant planets are in the ratio (n + 1):n, where n is an 
integer. Planets can be permanently captured into resonance if they 
cross the resonance during migration and the migration is convergent, 
that is, in a direction such that the period ratio evolves towards unity, 
rather than away. Capture into resonance during convergent migration 
is certain if the migration is slow enough and the planetary eccentricities 
are small enough’. It is therefore striking that the multi-planet sys- 
tems discovered by Kepler contain very few resonant planet pairs. The 
excess fraction of planet pairs in the Kepler sample having period ratios 
within 5-10% of 3:2 or 2:1 is less than 5%. Possible explanations for the 
small fraction of resonant planets include the following: migration was 
too fast for capture to occur (this, however, requires migration times 
of $10° years, much shorter than is plausible with disk migration’); 
stochastic torques on the migrating planet, which might arise in a turbu- 
lent protoplanetary disk"’®, allowed the planets to escape the resonances 
and continue migrating; eccentricity damping from the protoplanetary 


18 SEPTEMBER 2014 | VOL 513 | NATURE | 341 


© 2014 Macmillan Publishers Limited. All rights reserved 


REVIEW 


120 ;- 


100 |- 


10 100 
Separation/Hill radius 

Figure 4 | Separations of nearest neighbours in the Kepler multi-planet 
systems, measured in Hill radii. Masses are derived using the mass—radius 
relationship M,= M; (R,/R,)*, where a = 3 for R,< Ry and a = 2.06 for R, > R;. 
Blue histogram, planet pairs in systems with two known planets; green 
histogram, planet pairs in systems with three known planets; red histogram, 
planet pairs in systems with four or more known planets. Planets on circular, 
co-planar orbits separated by more than 9-12 Hill radii are expected to be stable 
for the lifetime of typical stars; the few planet candidates seen with smaller 
separations may have large errors in their estimated radii, not obey the assumed 
mass-radius relationship or not be planets orbiting the same star. (Data from the 
catalogue of ref. 76.) 
disk led to escape from the resonance’” (this mechanism requires that 
the Kepler planets have very small eccentricities because eccentricities 
are difficult to excite after migration is complete’); and the planets 
might have formed in situ rather than migrating. Although still poorly 
understood, a possible clue to the answer comes from the distribution of 
period ratios in the Kepler multi-planet systems; these are asymmetric 
around the strong 2:1 and 3:2 resonances, with a peak just outside the 
resonance and/or a valley inside’! 


Unsolved problems 
Kepler represents a watershed in our understanding of exoplanets and a 
great stride forward in understanding the properties of planetary systems 
and the problems in developing theories of their formation. But there are 
many aspects of planetary systems that Kepler has not illuminated. Kepler 
has opened up a new region in the orbital period versus radius plane 
(Fig. 1), containing planets as small as Earth's Moon at short periods, and 
larger planets with orbital periods as large as 1-2 years, but all of the plan- 
ets in the Solar System lie outside this region (although only just outside 
in the case of Venus and Earth). The atmospheres of giant planets must be 
investigated through transit observations by ground-based telescopes or 
the Hubble and Spitzer space telescopes, as Kepler has no spectral resolu- 
tion. The eccentricities of planetary orbits provide important insights into 
their formation, but only ground-based radial-velocity surveys can rou- 
tinely measure eccentricities. However, these are expensive because the 
Kepler host stars are so faint, and often impossible with current technol- 
ogy because of the small masses of typical Kepler planets. Kepler cannot 
detect multiple transits of planets with periods longer than a few years, so 
Kepler has not advanced our understanding of the region beyond a few 
times the Earth-Sun distance, where most giant planets are likely to be 
born. Planets are occasionally found at much larger radii and many have 
probably been ejected into interstellar space, but these regions can only be 
investigated by high-resolution imaging and gravitational microlensing. 
Kepler has hugely advanced our understanding of the phenomenology 
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of exoplanets, but so far has led us no closer to a secure theory of planet 
formation. Did the Kepler planets form in situ or did they migrate from 
larger radii? Why are small planets common around host stars with such 
a wide range of metallicities? How did the Kepler planets acquire their 
voluminous atmospheres, and why are the atmospheres so diverse in mass 
even for planets with similar core masses? How are the Kepler planets 
related to the terrestrial planets in the Solar System? Why does the typi- 
cal inclination of the Kepler planets, most of which are small, seem to be 
much less than the typical eccentricity of the radial-velocity planets, most 
of which are large? How are the large angles between some planetary 
orbital planes and the host-star equators generated? What is the relation- 
ship between the dynamics and formation of small, rocky planets and 
gas-giant planets'”'? 


After Kepler 

Although data acquisition by the Kepler spacecraft on its original target 
field has ended, ongoing data analysis and observational follow-up will 
refine the results already obtained and address some of the outstanding 
questions reviewed here. 

Better models of the stellar and instrumental noise in the transit light 
curves, including a rigorous treatment of temporally correlated noise, 
might enable the discovery of smaller, longer-period planets'’””” as well 
as better characterization of existing ones. More accurate transit times are 
crucial given that most Kepler planet masses are derived from TTVs, and 
more secure detections of, and upper limits on, transit duration variations 
(TDVs) will provide important constraints on mutual inclinations. 

We can hope to address some of the unanswered questions about 
occurrence rates and system architectures by more sophisticated sta- 
tistical analyses. The most complete view of the architectures of Kepler 
planetary systems will require tying together constraints from occurrence 
rates, transit durations, TT Vs and TDVs. Even the absence of TTVs can 
provide important constraints on planetary system architectures — for 
example, close-in giant planets seem to have fewer (or no) companion 
planets compared with more distant or smaller planets’. Insight can be 
gained into the composition of gas-poor planets by joint modelling in the 
space of radius, incident flux and, where available, mass (Fig. 2). 

Observational follow-up of Kepler targets from the ground and space 
is underway. High-quality spectra of Kepler host stars will help to refine 
estimates of their properties, and therefore reduce the uncertainty in 
stellar properties, which often dominates the uncertainty in planetary 
properties. A spectrum can also determine the star’s projected rotational 
velocity which, combined with the stellar spin period from Kepler pho- 
tometry, constrains the angle between the stellar equator and planetary 
orbit”. Spectra and adaptive-optics imaging will allow the catalogues to be 
culled more completely of false positives. The determination of accurate 
host-star metallicities, which has already provided new insights into the 
planet-formation process'*”’, will be greatly expanded by the Large Sky 
Area Multi-Object Fibre Spectroscopic Telescope (LAMOST)™. Pro- 
grammes are being developed to follow up some Kepler candidates with 
large TTVs and establish a longer baseline for TDVs using ground-based 
telescopes. Kepler stars are among the billions astrometrically monitored 
by the Gaia mission. Gaia will determine the distances of the Kepler tar- 
get stars, thereby improving our knowledge of stellar parameters and, 
consequently, the planetary radii; better radii will, in turn, improve esti- 
mates of the planetary occurrence rates and compositions, and correla- 
tions between planetary and stellar properties. For stars within around 
200 pc, both within and outside the Kepler field, Gaia can detect Jupiter 
analogues by the astrometric oscillations of their host stars, revealing a 
more complete architecture for systems in which only the close-in planets 
are detectable by transits or radial-velocity measurements. 

Space-based all-sky surveys, including TESS and PLATO (scheduled to 
launch by 2024) could greatly increase the science returns from Kepler by 
revisiting the Kepler field. Such follow-up would provide a long time base- 
line that would allow for improved occurrence rates, including an accurate 
value for n;, masses for longer period planets from TTVs that would help 
to address outstanding issues about their formation and composition, and 
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the possibility of a substantial number of TDV detections. 

Two spacecraft scheduled to be launched this decade will study known 
transiting planets. ESAs Characterising Exoplanets Satellite (CHEOPS) 
will target known exoplanet hosts to discover additional transiting gas- 
poor planets. NASA’s James Webb Space Telescope (JWST) will character- 
ize the atmospheres of gas-poor planets analogous to those that Kepler 
discovered in abundance; this might break the degeneracy among several 
composition possibilities. 

Since the first handful of exoplanet discoveries two decades ago, the 
pace of exoplanet research has been extraordinary’, driven primarily 
by ground-based radial-velocity searches of growing power and sophis- 
tication and by the Kepler mission. The armada of projects described in 
this Review will probe new regions of exoplanet parameter space and 
provide more detailed and accurate probes of the properties of known 
exoplanets. The challenges for the next two decades will be to maintain 
the momentum that has been built up in exoplanet research, and to 
work towards the even longer-term goal of producing an image of an 
extrasolar planet comparable with the iconic images of Earth taken by 
the Apollo astronauts. = 
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Highlights in the study of 
exoplanet atmospheres 


Adam S. Burrows! 


Exoplanets are now being discovered in profusion. To understand their character, however, we require spectral models 
and data. These elements of remote sensing can yield temperatures, compositions and even weather patterns, but only if 
significant improvements in both the parameter retrieval process and measurements are made. Despite heroic efforts to 
garner constraining data on exoplanet atmospheres and dynamics, reliable interpretation has frequently lagged behind 
ambition. [summarize the most productive, and at times novel, methods used to probe exoplanet atmospheres; highlight 
some of the most interesting results obtained; and suggest various broad theoretical topics in which further work could 


pay significant dividends. 


he modern era of exoplanet research started in 1995 with the 

discovery of the planet 51 Pegasi b’, when astronomers detected 

the periodic radial-velocity Doppler wobble in its star, 51 Peg, 
induced by the planet’s nearly circular orbit. With these data, and 
knowledge of the star, the orbital period (P) and semi-major axis (a) 
could be derived, and the planet’s mass constrained. However, the incli- 
nation of the planet’s orbit was unknown and, therefore, only a lower 
limit to its mass could be determined. With a lower limit of 0.47 M, 
(where M, is the mass of Jupiter) and given its proximity to its primary 
(ais about 0.052 au, 1 au being the Earth—Sun distance; one hundred 
times closer to its star than Jupiter is to the Sun), the induced Doppler 
wobble is optimal for detection by the radial-velocity technique. The 
question was how such a ‘hot Jupiter’ could exist and survive. Although 
its survival is now understood (see “Winds from planets’), the reason 
for its close orbital position is still a subject of vigorous debate. Never- 
theless, such close-in giants are selected for using the radial-velocity 
technique and soon scores, then hundreds, of such gas giants were 
discovered in this manner. 

However, aside from a limit on planet mass, and the inference that 
proximity to its star leads to a hot (1,000-2,000 kelvin (K)) irradiated 
atmosphere, no useful physical information on such planets was avail- 
able with which to study planet structure, their atmospheres or com- 
position. A breakthrough along the path to characterization, and the 
establishment of mature exoplanet science, occurred with the discovery 
of giant planets, still close-in, that transit the disk of their parent star. 
The chance ofa transit is larger if the planet is close, and HD 209458b, 
which is about 0.05 au from its star, was the first to be found’. Optical 
measurements yielded a radius for HD 209458b of about 1.36 R, where 
R, is the radius of Jupiter. Jupiter is roughly ten times, and Neptune 
is roughly four times, the radius of Earth (R,). Since then, hundreds 
of transiting giants have been discovered using ground-based facili- 
ties. The magnitude of the attendant diminution ofa star’s light during 
such a primary transit (eclipse) by a planet is the ratio of their areas 
(the transit depth, RR, where R, is the planet’s radius and R. is the 
star’s radius), so with knowledge of the star’s radius, the planet’s radius 
can be determined. Along with radial-velocity data, because the orbital 
inclination of a planet in transit is known, one then has a radius—mass 
pair with which to do some science. The transit depth ofa giant passing 
in front of a Sun-like star is about 1%, and such a large magnitude can 
easily be measured with small telescopes from the ground. A smaller, 


Earth-like planet requires the ability to measure transit depths 100 times 
more precisely. It was not long before many hundreds of gas giants were 
detected both in transit and by the radial-velocity method, the former 
requiring modest equipment and the latter requiring larger telescopes 
with state-of-the-art spectrometers with which to measure the small 
stellar wobbles. Both techniques favour close-in giants, so for many 
years these objects dominated the bestiary of known exoplanets. 

Better photometric precision near or below one part in 10*-10°, 
which is achievable only from space, is necessary to detect the tran- 
sits of Earth-like and Neptune-like exoplanets across Sun-like stars, 
and, with the advent of Kepler’ and CoRoT (Convection, Rotation and 
Planetary Transits)*, astronomers have now discovered a few thousand 
exoplanet candidates. Kepler in particular revealed that most planets 
are smaller than about 2.5 R, (four times smaller than Jupiter), but fewer 
than around 100 of the Kepler candidates are close enough to us to 
be measured with state-of-the-art radial-velocity techniques. Without 
masses, structural and bulk compositional inferences are problematic. 
Moreover, most of these finds are too distant for photometric or spec- 
troscopic follow-up from the ground or space to provide thermal and 
compositional information. 

A handful of the Kepler and CoRoT exoplanets, and many of the 
transiting giants and ‘sub-Neptunes’ discovered using ground-based 
techniques, are not very distant and have been photometrically and 
spectroscopically followed up using both ground-based and space- 
based assets to help to constrain their atmospheric properties. In this 
way, and with enough photons, some information on atmospheric com- 
positions and temperatures has been revealed for around 50 exoplanets, 
mostly giants. However, even these data are often sparse and ambigu- 
ous, rendering most such hard-won results provisional’. The nearby 
systems hosting larger transiting planets around smaller stars are the 
best targets for a programme of remote sensing to be undertaken, but 
such systems are a small subset of the thousands of exoplanets currently 
in the catalogues. 

One method by which astronomers are performing such studies is by 
measuring the transit radius as a function of wavelength®*. Because the 
opacity of molecules and atoms in a planet’s atmosphere is a function 
of wavelength, the apparent size of the planet is also a function of wave- 
length — in a manner that is characteristic of atmospheric composition. 
Such a ‘radius spectruny can reveal the atmosphere’s composition near 
the planet terminators, but the magnitude of the associated variation is 
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down from the average transit depth by a factor of around 2H/R,, where 
His the atmospheric scale height (a function of average temperature 
and gravity). This ratio can be ~0.1 to 0.01, correspondingly making it 
more difficult to determine a transit radius spectrum. Only telescopes 
such as the Spitzer Space Telescope’, the Hubble Space Telescope and 
the largest ground-based telescopes with advanced spectrometers are 
up to the task, and even then the results can be difficult to interpret. 

Another method probes the atmospheres of transiting exoplanets 
at secondary eclipse, when the star occults the planet about 180° out 
of phase with the primary transit. The abrupt difference between the 
summed spectrum of planet and star just before and during the eclipse 
of the planet by the star is the planet’s spectrum at full face. Second- 
ary eclipse spectra include reflected (mostly in the optical and near- 
ultraviolet region) and thermally emitted (mostly in the near- and 
mid-infrared region) light, and models are necessary to distinguish, if 
possible, the two components. It should be noted that separate images of 
the planet and star are not obtained by this technique, and a planet must 
be transiting. With few exceptions, when the planet does not transit, the 
summed light of a planet and star varies too slowly and smoothly for 
such a variation to be easily distinguished from the systematic uncer- 
tainties of the instruments to reveal the planet’s emissions as a function 
of orbital phase. For the close-in transiting hot Jupiters, the planet flux 
in the near-infrared is 10° times the stellar flux — much higher than the 
ratio expected for the class of planet in a wide orbit that can be separated 
from its primary star by high-contrast imaging techniques (see ‘High- 
contrast imaging’). In cases when such high-contrast direct imaging is 
feasible, the planet is farther away from the star (hence, dim) and dif- 
ficult to discern from under the stellar glare. However, hot, young giants 
can be self-luminous enough to be captured by current high-contrast 
imaging techniques, and a handful of young giant planets have been 
discovered and characterized by this technique. More are expected as 
the technology matures'”". 

The secondary eclipse and primary transit methods used to deter- 
mine or constrain atmospheric compositions and temperatures (as well 
as other properties) generally involve low-resolution spectra with large 
systematic and statistical errors. These methods are complementary 
in that transit spectra reliably reveal the presence of molecular and 
atomic features, and are an indirect measure of temperature through 
the pressure-scale height, whereas the flux levels of secondary eclipse 
spectra scale directly with temperature, but could in fact be featureless 
for an isothermal atmosphere. The theoretical spectra with which they 
are compared in order to extract parameter values are also imperfect, 
and this results in less trustworthy information than one would like. 
Giant planets (and ‘Neptunes’) orbiting closely around nearby stars 
are the easiest targets, and are the stepping stones to ‘Earths. Second- 
ary and primary transit spectral measurements of Earth-like planets 
around Sun-like stars, as well as direct high-contrast imaging of such 
small planets, are not currently feasible. However, measurements of 
exo-Earths around smaller M-dwarf stars might be, if suitable systems 
can be found. Nevertheless, with a few score of transit and secondary 
eclipse spectra, some planetary phase light curves, a few high-contrast 
campaigns and measurements, and some narrow-band, but very high 
spectral resolution measurements using large telescopes, the first gen- 
eration of exoplanet-atmosphere studies has begun. 

There are several helpful reviews on the theory of exoplanet atmos- 
pheres’*”. Added to these, there are informed discussions on the 
molecular spectroscopy and opacities that are central to model build- 
ing” ’’. Monographs on the relevant thermochemistry and abundances 
have been published over the years”. In this Review, I do not attempt 
to cover the literature of detections and claims, nor do I attempt to 
review the thermochemical, spectroscopic or dynamical modelling 
efforts so far. Instead, I focus on those few results concerning exoplanet 
atmospheres that to my mind stand out, that seem most robust and that 
collectively summarize what we have truly learned. I present, of neces- 
sity, only a small subset of the published literature, and no doubt some 
compelling results have been neglected for lack of space. In addition, I 
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touch on only the basics of the atmosphere theory applied so far, pre- 
ferring to focus, when possible, on the progress in theory that is neces- 
sary for the next generation of exoplanet-atmosphere studies to evolve 
productively. I embark on a discussion of what I deem to bea few of the 
milestone observational papers in core topics; these might be considered 
to constitute the spine of progress in recent exoplanet-atmosphere study. 
I accompany each with a short discussion of the associated theoretical 
challenges posed by the data. 


Transit detection of atoms and molecules 

The apparent transit radius of a planet with a gaseous atmosphere 
is the impact parameter of a ray of stellar light for which the opti- 
cal depth at that wavelength (A) is of order unity. It should be noted 
that at that level the corresponding radial optical depth, which if in 
absorption is relevant to emission spectra at secondary eclipse, will 
be much smaller. Because an atmosphere has a thickness (extent), 
and because absorption and scattering cross-sections are functions of 
photon wavelength that in combination with the air column constitute 
optical depth, the measured transit radius is a function of wavelength. 
Therefore, measurements of a planet’s transit depths at many wave- 
lengths of light reveal its atomic and molecular composition. A good 
approximation for this is given by™: 


dR p/din(A) ~ H din o(A)/din(A) (1) 


where o(A) is the composition-weighted total cross-section and the scale 
height, H, is kT/ug, where gis the planet's surface gravity, tis the mean 
molecular weight, Tis an average atmospheric temperature, and k is 
Boltzmann's constant. H sets the scale of the magnitude of potential 
fluctuations of R, with i, and o(A) is determined mostly by the atomic 
and molecular species in the atmosphere. 

Charbonneau et al.”* were the first to successfully use this technique 
with the 4-c measurement of atomic sodium in the atmosphere of 
HD 209458b. Along with HD 189733b, this nearby giant planet has been 
the most photometrically and spectroscopically scrutinized. Since then, 
Sing et al.*° have detected potassium in XO-2b and Pont et al.***” have 
detected both sodium and potassium in HD 189733b. These are all opti- 
cal measurements at and around the sodium D doublet (about 0.589 um) 
and the potassium resonance doublet (around 0.77 tum), and reveal the 
telltale differential transit depths in and out of the associated lines. 

Based on the study of brown dwarfs, the presence of neutral alkali 
metals in the atmospheres of irradiated exoplanets with similar atmos- 
pheric temperatures (~1,000-1,500 K) was expected, and their detection 
was gratifying. Indeed, there is a qualitative correspondence between 
the atmospheres of close-in and irradiated, or young giant planets (with 
masses of order M,) and older brown dwarfs (with masses of tens of M)). 
Alkalis persist to lower temperatures (~800-1,000 K) and are revealed 
in close-in exoplanet transit and emission spectra, and in older brown- 
dwarf emission spectra because silicon and aluminium, with which 
they would otherwise combine to form feldspars, are sequestered at 
higher temperatures and greater depths into more refractory species, 
and rained out. Had the elements with which sodium and potassium 
would have combined persisted in the atmosphere at altitude, these 
alkalis would have combined and their atomic form would not have 
been detected**. The more refractory silicates (and condensed iron) 
reside in giant exoplanets (and in Jupiter and Saturn), but at great 
depths. In L-dwarf brown dwarfs, they are at the surface, reddening the 
emergent spectra significantly. 

However, the strength, in transiting giant exoplanets, of the contrast 
in and out of these atomic alkali lines is generally less than expected’. 
Subsolar elemental sodium and potassium abundances, ionization by 
stellar light, and hazes have been invoked to explain the diminished 
strength of their associated lines, but the haze hypothesis is gaining 
ground. The definition of a haze can merge with that ofa cloud, but 
generally hazes are clouds of small particulates at altitude that may 
be condensates of trace species or products of photolysis by stellar 
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ultraviolet light and polymerization. They are generally not conden- 
sates of common or abundant molecular species (such as water, ammo- 
nia, iron or silicates, none of which fits the bill here). Although it is not 
at all clear what this haze is, hazes at altitude (<0.01 bars) can provide 
a nearly featureless continuum opacity to light and easily mute atomic 
and molecular line strengths. Indeed, hazes are emerging as central and 
ubiquitous features in exoplanet atmospheres. Annoyingly, not much 
mass is necessary to have an effect on transit spectra, making quantita- 
tive interpretation all the more difficult. The fact that the red colour of 
Jupiter itself is produced by a trace species (perhaps a haze) that so far 
has not been identified is a sobering testament to the difficulties that lie 
ahead in completely determining exoplanet atmospheric compositions. 

The multi-frequency transit measurements of HD 189733b from the 
near-ultraviolet to the mid-infrared by Pont et al.*°” are the clearest 
and most marked indications that some exoplanets have haze lay- 
ers (Fig. 1). Curiously, the measurements show no water or other 
molecular features in transit. Aside from the aforementioned sodium 
and potassium atomic features in the optical, the transit spectrum 
of HD 189733b is consistent with a featureless continuum. Water 
features in a hydrogen (H,) atmosphere are very difficult to com- 
pletely suppress, so their absence is strange. Furthermore, the transit 
radius increases below about 1.0 pm with decreasing wavelength in 
a manner that is reminiscent of Rayleigh scattering. However, owing 
to the large cross-sections implied, the culprit can only be a haze or 
a cloud. It should be mentioned that these transit data cannot distin- 
guish between absorption and scattering, although scattering is the 
more likely cause for most plausible haze materials and particle sizes. 
Scattering is also indicated by the near lack of evidence for absorb- 
ing particulates in HD 189733b secondary eclipse emission spectra”. 
Together, these data suggest that a scattering haze layer at altitude is 
obscuring the otherwise distinctive spectral features of the spectro- 
scopically active atmospheric constituents. 

Transit spectra for the mini-Neptune GJ 1214b have been taken 
by many groups, but the results concerning possible distinguish- 
ing spectral features have, until recently, been quite ambiguous”. In 
principle, there are diagnostic water features at around 1.15 um and 
1.4 um. However, Kreidberg et al.”", using the Wide Field Camera-3 
(WEC3) on the Hubble Space Telescope, have demonstrated that from 
~1.1 to 1.6 um its transit spectrum is around 5-10 times flatter than 
a water-rich, H,-dominated atmosphere with a solar abundance of 
water (oxygen) (Fig. 2). Flatness could indicate that the atmosphere 
has no scale height (see equation 1) (for example, due to a high mean 
molecular weight, 1), or herald the presence, yet again, of a thick haze 
layer obscuring the molecular features. Not surprisingly, a panchro- 
matic obscuring haze layer is currently the front runner. 

Lest one think that hazes completely mask the molecules of exo- 
planet atmospheres, Deming et al.” have published transit spectra of 
HD 209458b (Fig. 3) and XO- 1b that clearly show the water feature at 
around 1.4 um. However, the expected accompanying water feature 
at about 1.15 um is absent. The best interpretation of this is that this 
feature is suppressed by the presence of a haze with a continuum, 
although wavelength-dependent, interaction cross-section that trails 
off at longer wavelengths. The weaker apparent degree of suppression 
in these exoplanet atmospheres might suggest that their hazes are 
thinner or deeper (at higher pressures) than in HD 189733b. Physical 
models explaining this behaviour are lacking. 

So, the only atmospheric species that have clearly been identified 
in transit are water, sodium, potassium and a ‘haze. Molecular hydro- 
gen is the only gas with a low enough u to provide a scale height that 
is great enough to explain the detection in transit of any molecu- 
lar features (see equation 1) in a hot, irradiated atmosphere, and I 
would include it as indirectly indicated. However, carbon monoxide, 
carbon dioxide, ammonia, nitrogen gas, acetylene, ethylene, phos- 
phine, hydrogen sulphide, oxygen, ozone, nitrous oxide and hydro- 
gen cyanide have all been proferred as exoplanet atmosphere gases. 
Clearly, the field is in its spectroscopic infancy. Facilities such as 


REVIEW 


0.159} | 

0.158} + | 
+ 

- 0157+ ®& Fae | 

8 64 | Scale height 

% 0.156+ %, ba cale heig 

= C i. 

3 % + hy | 
0.1554 4 ie ee SMF Niacin ead 

Settling abet Cloud deck 

0.154} | 
0.1 53 1 1 1 pop peg 1 1 1 1 fg pp | 1 


104 ; 108 
Wavelength (A) 

Figure 1 | Transit spectrum of giant exoplanet HD 189733b. The planet/ 
star radius ratio against wavelength in angstréms. The black dots are the data 
points and the dotted lines are models. From left to right the dotted lines show 
the possible effect of Rayleigh scattering by mixed small grains at 2,000 K and 
at 1,300 K, by settling grains and by an opaque cloud deck. The grey line is an 
example spectrum without a haze. Reprinted with permission from ref. 36. 


next-generation ground-based telescopes (extremely large telescopes, 
ELTs) and space-based telescopes such as the James Webb Space Tel- 
escope (JWST)”, or a dedicated exoplanet space-based spectrometer, 
will be essential if transit spectroscopy is to realize its true potential 
for exoplanet atmospheric characterization. The JWST in particular 
will have spectroscopic capability from ~0.6 to ~28.3 um and will 
be sensitive to most of the useful atmospheric features expected in 
giant, Neptune-like and sub-Neptune exoplanets. It may also be able 
to detect and characterize a close-in Earth or super-Earth around a 
nearby small M star. 

There are a number of theoretical challenges that must be met before 
transit data can be converted into reliable knowledge. Such spectra 
probe the terminator region of the planet that separates the day and 
night sides. They sample the transitional region between the hotter 
day and cooler night of the planet, at which the compositions may be 
changing and condensates may be forming. Hence, the compositions 
extracted may not be representative even of the bulk atmosphere. Ide- 
ally, one would want to construct dynamical three-dimensional (3D) 
atmospheric circulation models that couple non-equilibrium chemistry 
and detailed molecular opacity databases with multi-angle 3D radia- 
tion transfer. Given the emergence of hazes and clouds as potentially 
important features of exoplanet atmospheres, a meteorologically cred- 
ible condensate model is also desired. We are far from the latter’, and 
the former’s capabilities are only now being constructed, with limited 
success“. The dependence of transit spectra on species abundance is 
weak, making it now difficult to derive mixing ratios from transit spec- 
tra to better than a factor of 10 to 100. Although the magnitude of the 
variation of apparent radius with wavelength depends on atmospheric 
scale height, and hence temperature, the temperature—pressure profile 
and the variation of abundance with altitude are not easily constrained. 
To obtain even zeroth-order information, one frequently creates isother- 
mal atmospheres with chemical equilibrium or uniform composition. 
Current haze models are ad hoc, and adjusted a posteriori to fit the 
all-too-sparse and at times ambiguous data. To justify the effort neces- 
sary to do better will require much improved and higher-resolution 
measured spectra’. 

Data at secondary eclipse require a similar modelling effort, but 
probe the integrated flux of the entire dayside. Hence, a model that 
correctly incorporates the effects of stellar irradiation (‘instellatiom) 
and limb effects is necessary. Moreover, the flux from the cooling plan- 
etary core, its longitudinal and latitudinal variation, and a circulation 
model that redistributes energy and composition are needed. Most 
models employed so far use a representative one-dimensional (planar) 
approximation, and radiative and chemical equilibrium for what is a 
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Figure 2 | The transmission spectrum of GJ 1214b. The relative depth of the transit of the sub-Neptune GJ 1214b against wavelength (1.1 um to 1.7 um). The 
coloured lines in both (a) and (b) are various transit spectral models without a haze (cloud-free solar composition, orange; 100% water, blue; methane, green; 
and carbon dioxide, red). The data (black and grey dots) are effectively flat, ruling out all models shown and suggesting a veiling haze. 


hemispherical region that might be out of chemical equilibrium (and 
slightly out of radiative equilibrium). The emission spectra of the day- 
side depend more on the absorptive opacities, whereas transit spectra 
depend on both scattering and absorption opacities. Hence, if the haze 
inferred in some transit spectra is due predominantly to scattering, its 
effect on secondary eclipse spectra will be minimal, making it slightly 
more difficult to use insight gained from one to inform the modelling 
of the other. 

Many giant exoplanets, and a few sub-Neptunes, have been observed 
at secondary eclipse, but the vast bulk of these data are comprised ofa 
few photometric points per planet. The lion’s share have been garnered 
using Spitzer, Hubble or large-aperture ground-based telescopes, and 
pioneering attempts to inaugurate this science were carried out by 
Deming et al.*° and Charbonneau et al.**. Photometry, particularly if 
derived using techniques that are subject to systematic errors, is ill- 
suited to delivering solid information on composition, thermal pro- 
files or atmospheric dynamics. The most one can do with photometry 
at secondary eclipse is to determine rough average emission tempera- 
tures, and perhaps reflection albedos in the optical. Temperatures for 
close-in giant exoplanet atmospheres from around 1,000 to 3,000 K 
have, in this way, been determined. Of course, the mere detection of 
an exoplanet is a victory, and the efforts that have gone into winning 
these data should not be discounted. Nevertheless, with nearly 50 such 
campaigns and detections ‘in the can, we have learned that it is only 
with next-generation spectra that use improved (perhaps dedicated) 
spectroscopic capabilities that the desired thermal and compositional 
information will be forthcoming. 

One of the few reliable compositional determinations at secondary 
eclipse obtained so far is for the dayside atmosphere of HD 189733b 
using the now-defunct infrared spectrograph on-board Spitzer™. This 
very-low-resolution spectrum nevertheless provided a 30 detection 
of water at around 6.2 um. Other papers have reported the detec- 
tion of molecules at secondary eclipse, but many are less compelling, 
and earlier reports of water being detected using photometry alone 
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at secondary eclipse are very model-dependent™. It is only with well- 
calibrated spectra that one can determine with confidence the pres- 
ence in any exoplanet atmosphere of any molecule or atom. 


Winds from planets 

The existence of what are now somewhat contradictorily called hot Jupi- 
ters has, since the discovery of 51 Peg b in 1995, been somewhat of a 
puzzle. These planets probably cannot form as close as they are observed 
to their parent star and must migrate in, by some process, from beyond 
the so-called ice line. In such cold regions, ices can form and accumulate 
to nucleate gas-giant formation. Subsequent inward migration could be 
driven early in the planet’s life by gravitational torquing by the proto- 
planetary disk or by planet-planet scattering, followed by tidal dissipa- 
tion in the planet (which circularizes its orbit). However, once parked at 
between ~0.01 au and 0.1 Au from the star, how does the gaseous planet, 
or a gaseous atmosphere of a smaller planet, survive evaporation by the 
star’s intense irradiation during perhaps billions of years seemingly in 
extremis? The answer is that for sub- Neptunes and rocky planets their 
atmospheres or gaseous envelopes might not survive, but for more mas- 
sive gas giants the gravitational well at their surfaces may be sufficiently 
deep. Nevertheless, since the first discoveries, evaporation has been of 
interest. The atmospheres of Earth and Jupiter are known to be evapo- 
rating, although at a very low rate. But what happens to a hot Jupiter that 
experiences 10° times the instellation that Jupiter does? 

The answer came with the detection by Vidal-Madjar et al.” of a 
wind from HD 209458b. Using the transit method, but in the ultravio- 
let around the Lyman-a line of atomic hydrogen at around 0.12 um, 
the authors measured a transit depth of about 15%. Such a large depth 
implies a planet radius greater than four Rj, which is not only much 
greater than what is inferred in the optical, but beyond the tidal Roche 
radius. Matter at such distances is not bound to the planet, and the only 
plausible explanation was that a wind was being blown off the planet. 
The absorption cross-sections in the ultraviolet are huge, so the matter 
densities that are necessary to generate a transverse chord optical depth 
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of one are very low — too low to affect the optical and infrared measure- 
ments. The upshot of this is the presence of a quasi-steady planetary 
wind with a mass-loss rate of 10'°-10" gm s |. Atthat rate, HD 209458b 
will lose no more than around 10% of its mass in Hubble time. 

Since this initial discovery, winds from the hot Jupiters HD 189733b”’ 
and WASP-12b*, and from the hot Neptune GJ 436b™ have been dis- 
covered by the ultraviolet transit method and partially characterized. In 
all cases, the telltale indicator was in atomic hydrogen. Mass-loss rates 
have been estimated™, and in the case of WASP-12b might be sufficient 
to completely evaporate the giant within as little as about 1 gigayear. The 
presence of atomic hydrogen implies the photolytic or thermal break-up 
of molecular hydrogen, so these data simultaneously suggest the pres- 
ence of both H and H,, Linsky et al.** detected ionized carbon and silicon 
in HD 209458b’s wind, and Fossati et al.°! detected ionized magnesium 
in WASP-12b’s wind, but the interpretation of the various ionized spe- 
cies detected in these transit-observation campaigns is ongoing. 

The theoretical challenges posed by planetary winds revolve partly 
around the driver. Is the wind driven by the subset of the instellation 
represented by the ultraviolet and X-ray component of the total stellar 
flux? In addition, in the rotating system of the orbiting planet, what 
ingress or egress asymmetries in the morphology of the wind exist? 
There are indications that Coriolis forces on planet winds are indeed 
shifting the times of ingress and egress. What is the effect of planet-star 
wind interactions? There are suggestions of Doppler shifts in lines of 
the ultraviolet transit data that arise from planet-wind speeds, but how 
can we be sure? How is the material for the wind replenished from the 
planet atmosphere and interior? And finally, what is the correspondence 
between the ultraviolet photolytic chemistry in the upper reaches of the 
atmosphere that modifies its composition there and wind dynamics? 
This is a rich subject tied to many subfields of science, and is one of the 
important topics to emerge from transit spectroscopy. 


Phase light curves and planet maps 

Asa planet traverses its orbit, its brightness, as measured at Earth at a 
given wavelength, varies with orbital phase. A phase light curve com- 
prises both a reflected component that is a stiff function of the star- 
planet-Earth angle and is most prominent in the optical and ultraviolet; 
and a thermal component that more directly depends on the tempera- 
ture and composition of the planet’s atmosphere, and their longitudinal 
variation around the planet, and is most prominent in the near- and 
mid-infrared. Hence, a phase light curve is sensitive to the day-night 
contrast and is a useful probe of planetary atmospheres”. It should be 
mentioned that the planet/star contrast ratio is largest for large exoplan- 
ets in the closest orbits, so hot Jupiters currently provide the best targets. 

In the optical, there has been some work to derive the albedo”, 
or reflectivity, of close-in exoplanets, which is largest when there are 
reflecting clouds and smallest when the atmosphere is absorbing. In 
the latter case, thermal emission at high atmospheric temperatures can 
be mistaken for reflection, so detailed modelling is required. In any 
case, Kepler, with its superb photometric sensitivity, has been used to 
determine optical phase curves” of a few exo-giants in the Kepler field, 
and the MOST (Microvariability and Oscillations of Stars) microsatel- 
lite has put a low upper limit on the optical albedo of HD 209458b"”, 
but much remains to be done to extract diagnostic optical phase curves 
and albedos for exoplanets. 

Interesting progress has been made, however, in the thermal infrared. 
Using Spitzer at 8 pm, Knutson et al.” not only derived a phase light 
curve for HD 189733b, but derived a crude thermal map of its surface. 
By assuming that the thermal emission pattern over the planet surface 
was fixed during the observations, they derived the day-night bright- 
ness contrast (translated into a brightness temperature at 8 um) anda 
longitudinal brightness temperature distribution. In particular, they 
measured the position of the ‘hot spot’ If the planet is in synchronous 
rotation (spin period is the same as the orbital period), and there are 
no equatorial winds to advect heat around the planet, one would expect 
the hot spot to be at the substellar point. The light curve would phase 
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up with the orbit and the peak brightness would occur at the centre 
of secondary eclipse. However, what the authors observed was a shift 
downwind to the east by around 16° + 6. The most straightforward 
interpretation is that the stellar heat absorbed by the planet is advected 
downstream before being re-radiated by super-rotational flows such as 
those that are observed on Jupiter itself. Moreover, these data indicate 
that, because the measured day-night brightness temperature contrast 
was only about 240 K, the zonal wind flows driven by stellar irradiation 
carry heat to the night side, where it is radiated at a detectable level. 
Hence, these data point to the existence of atmospheric dynamics on 
HD 189733b, qualitatively (although not quantitatively) in line with 
theoretical expectations™. 

For HD 189733b, this work has been followed up using Spitzer at 
3.6 um and 4.5 um™ and, in a competing effort, a more refined map has 
been produced®. Infrared phase curves for the giants HD 149026b®, 
HAT-P-2b” and WASP-12b®, among other exoplanets, have been 
obtained. However, one of the most intriguing phase curves was 
obtained by Crossfield et al.” using Spitzer at 24 um for the non-trans- 
iting planet v Andromedae b (Fig. 4). The authors found a huge phase 
offset of around 80°, for which a cogent explanation is still lacking. The 
closeness of this planet to Earth could partly compensate for the fact 
that it is not transiting to allow sufficient photometric accuracy without 
eclipse calibration, yielding one of the few non-transiting light curves. 
All these efforts collectively demonstrate the multiple, at times unan- 
ticipated and creative, methods being employed by observers seeking 
to squeeze whatever information they can from exoplanets. 

Theoretical models for light curves have been sophisticated, but 
theory and measurement have not yet meshed well. Both need to be 
improved. First, models need to be improved in terms of their treat- 
ment of hazes and clouds that could reside in exoplanet atmospheres 
and will boost reflection albedos significantly; second, they need to 
incorporate polarization to realize its diagnostic potential”””; third, 
they should constrain the possible range of phase functions to aid in 
retrievals; fourth, they need to embed the effects of variations in planet 
latitude and longitude in the analysis protocols; fifth, they should pro- 
vide observational diagnostics with which to probe atmospheric pres- 
sure depths, particularly using multi-frequency data; sixth, they should 
be constructed as a function of orbital eccentricity, semi-major axis, 
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Figure 3 | Transit depth spectrum of the hot Jupiter HD 209458b. Data 
points are shown as black circles and open squares/diamonds. The presence 
of water is demonstrated by the occurrence of a feature at 1.4 jum, but the 
corresponding ~1.15-um feature is absent. The best explanation is that 

the latter is suppressed by haze scattering. Not obvious here is the fact that 
even the 1.4-m feature is muted with respect to non-haze models. The two 
coloured curves are representative model spectra with different levels of haze. 
Reprinted with permission from ref. 42. 
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Figure 4| The measured light curve of v Andromedae b. The thin black dotted curve is the authors’ best fit to the data points (black dots with error bars at a 
wavelength of 24 um), showing a phase offset of ~80° (22% ofa circuit). Reprinted with permission from ref. 69. 


and inclination; and finally they should span the wide range of masses 
and compositions that the heterogeneous class of exoplanets is likely 
to occupy. Accurate spectral data with good time coverage from the 
optical to the mid-infrared could be game-changing, but theory needs 
to be ready with useful physical diagnostics. 


High spectral resolution techniques 

The intrinsic dimness of planets under the glare of stars renders high- 
resolution, panchromatic spectral measurements difficult, if desirable. 
However, ultra-high spectral resolution measurements using large- 
aperture ground-based telescopes, but over a very narrow spectral 
range and targeting molecular band features in a planet’s atmosphere 
that are otherwise jumbled together at lower resolutions, has recently 
been demonstrated. Snellen et al.’' have detected the Doppler variation 
owing to HD 209458b’s orbital motion of carbon monoxide features near 
~2.3 um. The required spectral resolution (A/AA) was about 10° and 
the plant’s projected radial velocity just before and just after primary 
transit changed from +15 kms 'to-15 kms ‘. This is consistent with 
the expected circular orbital speed of around 140 km s"‘ and provides 


Figure 5 | Surface map of brown dwarf Luhman 16B. These maps are 
obtained by Doppler imaging and depict different epochs during the rotation 
of Luhman 16B. Large-scale cloud inhomogeneities are suggested by the dark 
patches at 2.4 hours and 3.2 hours. The rotation period of the brown dwarf is 
4.9 hours. Reprinted with permission from ref. 76. 
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an unambiguous detection of carbon monoxide. Furthermore, this team 
attempted to measure the zonal wind speeds of air around the planet, 
estimated theoretically to be near ~1 kms ', thereby demonstrating 
the potential of such a novel technique to extract weather features on 
giant exoplanets. The same basic method has been applied near primary 
transit to detect carbon monoxide” and water” in HD 189733b. Carbon 
monoxide can be detected in Jupiter and was thermochemically pre- 
dicted to exist in abundance in the atmospheres of hot Jupiters”, but its 
actual detection by this method is impressive. 

In fact, the same technique has been successfully applied in the car- 
bon monoxide band to the non-transiting planet t Bodtis b” and for the 
wide-separation giant planet (or brown dwarf) B Pictoris b”, verifying 
the presence of carbon monoxide in both their atmospheres. Finally, 
using a related technique Crossfield et al.”* have been able to conduct 
high-resolution ‘Doppler imaging’ of the closest known brown dwarf 
(Luhman 16B). By assuming that the brown dwarf’s surface features 
are frozen during the observations and that it is in solid-body rotation, 
and by dividing the surface into a grid in latitude and longitude, they 
were able to determine (by model fitting) surface brightness variations 
from the variations of its flux and Doppler-shift time series. By this 
means, they mapped surface spotting that may reflect broken cloud 
structures (Fig. 5). 

In support of such measurements, theory needs to refine its modelling 
of planet surfaces, zonal flows and weather features, 3D heat redistribu- 
tion and velocity fields, and temporal variability. Currently, most 3D 
general circulation models do not properly treat high Mach number 
flows, but they predict zonal wind Mach numbers of order unity. There 
are suggestions that magnetic fields affect the wind dynamics and heat- 
ing in the atmosphere, but self-consistent multi-dimensional radiation 
magnetohydrodynamic models have not yet been constructed. 

This series of measurements of giant exoplanets and brown dwarfs 
using high-resolution spectroscopy focused on narrow molecular fea- 
tures emphasizes two important aspects of exoplanet research. The 
first is that observers can be clever and develop methods unanticipated 
in roadmap documents and decadal surveys. The second is that with 
the next-generation of ground-based ELTs equipped with impressive 
spectrometers, astronomers may be able to measure and map some 
exoplanets without using the high-contrast imaging techniques that 
are now emerging to compete. 


High-contrast imaging 

Before the successful emergence of radial-velocity and transit methods, 
astronomers expected that high-contrast direct imaging that separates 
out the light of the planet and of the star, and provides photometric 
and spectroscopic data for each, would be the leading means of exo- 
planet discovery and characterization. A few wide-separation brown 
dwarfs and/or super-Jupiter planets were detected by this means, but 
the yield was meagre. The fundamental problem is twofold: the planets 
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are intrinsically dim, and it is difficult to separate out the light of the 
planet from under the glare of the star for planet-star separations like 
those of the Solar System. Imaging systems need to suppress the stellar 
light scattered in the optics that would otherwise swamp the planet’s 
signature. The planet/star contrast ratio for Jupiter is ~10-” in the optical 
and ~10°’ in the mid-infrared. For Earth, the corresponding numbers 
are ~10°!°and ~10°°. These numbers are age, mass, orbital distance and 
star dependent, but demonstrate the challenge. Furthermore, contrast 
capabilities are functions of planet-star angular separation, restricting 
the orbital space that is accessible. 

However, high-contrast imaging is finally emerging to complement 
other methods. It is most sensitive to wider-separation (~10-200 av), 
younger giant exoplanets (and brown dwarfs), but technologies are 
coming online with which to detect older and less massive exoplanets 
down to around 1.0 Au separations for nearby stars (closer than ~10 par- 
secs)'”!*’’, Super-Neptunes around M dwarfs might soon be within 
reach. Using direct imaging, Marois et al.”*” have detected four giant 
planets orbiting the A star HR 8799 (HR 8799b, HR 8799c, HR 8799d 
and HR 8799e) and Lagrange et al.*’ have detected a planet around 
the A star B Pictoris. The contrast ratios in the near infrared are about 
10°*, but capabilities near 10° have been achieved and performance 
near 10°’ is soon anticipated’*"’. One of the results to emerge from the 
measurements of both the HR 8799 and 6 Pictoris planets is that to fit 
their photometry in the near-infrared from ~1.0 to 3.0 jum, thick clouds, 
even thicker than those seen in L-dwarf brown dwarf atmospheres, are 
necessary”. This re-emphasizes the theme that the study of hazes and 
clouds (nephelometry) has emerged as a core topic in exoplanet studies. 

One of the most exciting recent measurements through direct imag- 
ing was by Konopacky et al.” of HR 8799c. Using the Ohio State Infra- 
red Imager/Spectrometer (OSIRIS) on the 10-metre Keck II telescope, 
they obtained unambiguous detections between ~1.95 um and 2.4 um 
of both water and carbon monoxide in its ~1,000 K atmosphere. This 
\/AX = 4,000 spectrum is one of the best obtained so far, but was enabled 
by the youth (around 30 million years), wide-angular separation and 
large mass (~5-10 M,) of the planet. 

Improvements in theory that are needed to support direct imaging 
campaigns mirror those needed for light curves, but are augmented 
to include planet-evolution modelling to account for age, metallicity 
or composition and mass variations. Most high-contrast instruments 
are focused on the near-infrared, so cloud physics and near-infrared 
line lists for likely atmospheric constituents will require further work. 
The reader will note again that most observations and measurements 
of exoplanet atmospheres have been for giants. There are a few for 
sub-Neptunes and super-Earths, but high-contrast measurements of 
Earths around G stars like the Sun are not likely in the near future’. 
The planet/star contrast ratios are just too low, although Earths around 
Mstars might be within reach — if we get lucky. For now, giants and 
Neptunes are the focus, as astronomers hone their skills for an even 
more challenging future. 


What we know about atmospheric compositions 
The species we have, without ambiguity, discovered so far in exoplanet 
atmospheres are: water, carbon monoxide, sodium, potassium and 
hydrogen (H,), with various ionized metals indicated in exoplanet 
winds. Expected, but as yet undetected, species include: ammonia, 
methane, nitrogen gas, carbon dioxide, hydrogen sulphide, phosphine, 
hydrogen cyanide, acetylene, ethylene, oxygen, ozone and nitrous oxide. 
The nature of the hazes and clouds inferred is as yet unknown. The 
atmospheres probed have temperatures from ~600 K to ~3,000 K. Good 
spectra are the essential requirements for unambiguous detection and 
identification of molecules in exoplanet atmospheres, and these have 
been rare. Determining abundances is also difficult, because to do so 
requires not only good spectra, but also reliable models. Errors in abun- 
dance retrievals of more than an order of magnitude are likely, and this 
fact has limited the discussion of abundances in this paper. 
Nevertheless, with the construction of ground-based ELTs, the 
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various campaigns of direct imaging’, the launch of the JWST, the 
possible launch of the 2.4 m Wide-Field Infrared Survey Telescope 
(WFIRST)-AFTA”, the various ongoing campaigns with Hubble and 
Spitzer, and with extant ground-based facilities, the near-term future 
of exoplanet atmospheric characterization promises to be even more 
exciting than its past. m 
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The role of space telescopes in the 
characterization of transiting exoplanets 


Artie P. Hatzes' 


Characterization studies now have a dominant role in the field of exoplanets. Such studies include the measurement of an 
exoplanet’s bulk density, its brightness temperature and the chemical composition of its atmosphere. The use of space 
telescopes has played a key part in the characterization of transiting exoplanets. These facilities offer astronomers data of 
exquisite precision and temporal sampling as well as access to wavelength regions of the electromagnetic spectrum that 
are inaccessible from the ground. Space missions such as the Hubble Space Telescope, Microvariability and Oscillations 
of Stars (MOST), Spitzer Space Telescope, Convection, Rotation and Planetary Transits (CoRoT), and Kepler have rapidly 
advanced our knowledge of the physical properties of exoplanets and have blazed a trail for a series of future space mis- 
sions that will help us to understand the observed diversity of exoplanets. 


from a field that focused primarily on discoveries to one in 

which their characterization has taken central stage. These 
characterization studies include measurements of the mass, radius and 
bulk density of the planet, as well as the albedo, brightness temperature 
and composition of the atmosphere. The drivers of such studies are 
transiting exoplanets — planets whose orbital inclination with the line- 
of-sight is such that the planet periodically crosses in front of the star, 
resulting in a slight dip in its brightness. 

The transit yields important characteristics about the exoplanet. From 
the dip in the stellar brightness during the transit, astronomers can 
derive the radius of the planet. When combined with the mass deter- 
mination from spectroscopic Doppler measurements, this can be used 
to obtain the planet's mean density. The density gives us the first hints 
about the internal structure of the planet. 

It is also possible to characterize the atmosphere of a transiting exo- 
planet using measurements taken at different points in its orbit. These 
measurements include in-transit spectra, phase variations of the exo- 
planet as it orbits the star and exoplanet occultations. 

During the transit, a fraction of the starlight is absorbed by the plan- 
etary atmosphere. The planet’s atmosphere thus imprints itself on the 
stellar spectrum giving us a glimpse of its composition. These kinds of 
measurements are called in-transit spectroscopy. 

As the planet orbits, the illuminated portion of the sphere as seen 
from Earth changes, resulting in light variations — called a brightness 
phase curve — during the orbit. Measuring the phase curve enables 
astronomers to map the brightness distribution on the planet. High- 
precision photometric measurements can also detect other subtle vari- 
ations in the light curve of the star. One effect is the so-called ellipsoidal 
variation that results from the distortion of the star due to tides raised by 
the planet. Another is the relativistic beaming effect, resulting from the 
star’s reflex motion about the centre of mass of the system. Both of these 
phenomena provide independent measurements of the planet mass. 

Finally, when the planet disappears behind the star, its reflected and 
radiated light is blocked. Phase and occultation measurements give us 
information about the brightness temperature, the albedo (the fraction of 
the incoming light that is reflected) and the exoplanet’s spectral features. 

Space-based instruments offer us the best means of characterizing 
transiting exoplanets. The superb temporal sampling (nearly continuous 


() ver the past decade the study of exoplanets has rapidly evolved 


observations from months to years) allows us to extend the parameter 
space of transiting exoplanets to those with much longer orbital periods 
than can be discovered with ground-based telescopes. The exquisite 
photometric precision also allows astronomers to detect smaller planets 
than they could from the ground — exoplanets whose radii are compa- 
rable with that of the Earth, or even smaller. These high-precision meas- 
urements from space also make it easier to detect the minute signatures 
(less than 10~) of the reflected and radiated light from the exoplanet, as 
well as its atmospheric features. Finally, space instruments can access 
important wavelength regions, such as the infrared, that are simply not 
accessible from the ground. 

Several space missions have contributed considerably to the discov- 
ery, study and characterization of transiting exoplanets. These include 
the Convection, Rotation and Planetary Transits (CoRoT) satellite, 
Kepler, the Microvariability and Oscillations of Stars (MOST) space 
telescope, the Hubble Space Telescope and the Spitzer Space Telescope. 
(See Review by Lissauer on page 336 for the contributions from the 
Kepler mission.) 


CoRoT mission 

CoRoT was the first spacecraft devoted to the discovery of transiting 
planets as well as the asteroseismic study of stars’. This mission was led 
by the French space agency Centre National d’Etudes Spatiales (CNES) 
along with contributions from the European Space Agency (ESA), Aus- 
tria, Belgium, Brazil, Germany and Spain. Launched in December 2006, 
the 27-cm CoRoT telescope set the stage for NASAs larger and more 
ambitious Kepler mission. Over its lifetime, CoRoT observed more than 
20 different star fields for up to a maximum of 150 days, collecting 
photometric data on approximately 175,000 stars. 

CoRoT light curves are still being analysed for the signals of plan- 
ets, but to date the mission has discovered 32 confirmed exoplanets, 
mostly gaseous giant planets (Fig. 1). For over two orders of magnitude 
in planet mass (7m ~ 0.3 — 20 M,) these planets follow a tight mass—den- 
sity power law: 


e(gm cm”) = (0.73 +0.1 yee) 


where M, is the mass of Jupiter. This nearly linear relationship of 
density with mass reflects the fact that all objects with masses ranging 
from giant planets (~1 M,) to low-mass stars (~100 M,) have about the 
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Figure 1 | Mass-density diagram for the CoRoT giant planets. Of the 

32 exoplanets that CoRoT has discovered 26 are giant planets; the outliers 
CoRoT-8b, CoRoT-13b and CoRoT-20b are highlighted. The line represents 
a power law of the form p(gm cm *) = (0.73 + 0.1) Mj!7*°!), where M,is the 
mass of Jupiter. 


same radius. Thus, an increase in mass is accompanied by a propor- 
tional increase in density. This is expected as the pressure support in 
giant planets is provided by the electron-degeneracy pressure. Stars, 
however, have their pressure support provided by hydrogen burning 
under hydrostatic equilibrium. 

There are two outliers that deviate from this power law relationship by 
almost 40: CoRoT-13b and CoRoT-20b. (The outlier at low planet mass 
CoRoT-8b is outside the mass range of the computed fit.) These planets 
have anomalous densities for their respective sizes. One hypothesis is 
that these may be the merger of two more ‘normal giant planets’. 

The CoRoT mission has contributed to exoplanet studies in a number 
of key areas. 


Transiting planets around active stars 

One of CoRoT’s first exoplanet discoveries was CoRoT-2b’. This planet 
orbits a very active star, the surface of which is covered with cool spots. 
These spots produce photometric variability with an amplitude of about 
2% that can mask the photometric dip caused by the transit events. 
CoRoT-2 is the most active star known to host a transiting exoplanet. 
Such a discovery would have been very difficult with ground-based 
measurements, and immediately demonstrated the value of those made 
from space. Planets around active stars are important for studying the 
interaction between the magnetic fields of the star and planet’. 


Cool giant planets 

Before the launch of Kepler, CoRoT’s planets accounted for half of trans- 
iting systems with orbital periods greater than 8 days. Noteworthy is 
CoRoT-9b — a planet in a 95-day orbit and the first transiting giant 
planet discovered with a moderate temperature’. Although most hot 
Jupiters have anomalously large radii that are typically 30-50% larger 
than that of Jupiter (R,), CoRoT-9b has a quite normal radius (1.01 R,). 
This supports the hypothesis that the anomalous radii of close-in giant 
planets are somehow associated with the high radiation flux coming 
from the host star®°. CoRoT-9, with a brightness temperature of about 
300 kelvin (K), has a ‘balmier’ temperature compared with other close- 
in transiting giant planets. 


Oases in the brown-dwarf desert 

Brown dwarfs are objects with masses (~13-60 M,) that lie between giant 
planets and stars that are fusing hydrogen in their cores. Like planets, 
brown dwarfs undergo no core nuclear burning, but they do go through 
a short phase of deuterium burning. Ground-based Doppler measure- 
ments of stars have established that there is a paucity of such objects in 
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relatively short-period orbits, the ‘brown-dwarf desert. CoRoT discov- 
ered at least three objects in this desert. In particular, CoRoT-3b with a 
mass of 22 M, and the same radius as Jupiter’ was the first brown dwarf 
to be characterized in terms of its mass and radius. This object may help 
our understanding of the relationship between giant planets and brown 
dwarfs and their respective formation scenarios. 


Arocky super-Earth 

The jewel in the crown of the CoRoT discoveries was the detection and 
characterization of the first transiting rocky planet, CoRoT-7b. The host 
star shows transit events every 20.5 hours with a photometric depth of 
a mere 0.03% caused by a planet with a radius 1.6 that of Earth (R,)*. 
Ground-based spectroscopic Doppler measurements determined the 
mass of CoRoT-7b as 7.42 + 1.21 M, (where M,is the mass of Earth)’. 
This corresponds to a density, p, of 10.4 + 1.8 gcm”™, possibly making 
the structure of CoRoT-7b closer to that of Mercury than that of Earth. 
CoRoT-7b gave us the first indication that rocky planets can have ultra- 
short-period orbits (orbital periods of less than 1 day). Shortly after its 
launch, Kepler discovered a virtual twin to CoORoT-7b — Kepler-10b”°. 
This exoplanet has nearly the same orbital period (0.84 days) as 
CoRoT-7b, is slightly smaller (1.4 R,), less massive (4.6 M,), but has 
comparable bulk density (8.8 g cm”). Currently, the short-period 
rocky planet record holder is Kepler-78b, an Earth-sized planet that 
races around its host star in a mere 8.5 hours'’. Recent ground-based 
Doppler measurements were able to confirm this as a rocky planet'*”? 
that has an Earth-like density of 5.5 gmcm”. 

Although CoRoT and Kepler were both devoted to the detection of 
planets by the transit method, the two missions were complementary 
in several ways. First, the target selection of Kepler was geared pre- 
dominantly to stars like our Sun, whereas CoRoT obtained data on 
most stars in its field of view. As a result, CoRoT targets included a 
significant fraction of stars that were not Sun-like. About 35% of the 
CoRoT exoplanet discoveries orbit F-type stars, which are hotter and 
more massive than the Sun. By contrast, there is a 20% occurrence rate 
for planets around such stars found by Kepler. This may reflect the fact 
that CoRoT included more F-type stars as targets. 

Second, the two missions searched for exoplanets in different direc- 
tions in our Galaxy. Kepler looked at a field near the constellation of 
Cygnus, just off the plane of the Milky Way. Whereas, CoRoT looked 
at two regions in the sky (the ‘eyes of CoRoT”) in the galactic plane, one 
towards the centre of our galaxy and another towards the anti-centre. 
The galactic plane contains a larger number of young stars. Conse- 
quently, several of CoRoT’s discoveries have been found around rela- 
tively young host stars. One of its most recent discoveries, CoRoT-32b 
(D. Gandolfi et al., manuscript in preparation), which is about 25 mil- 
lion years old, might be one of the youngest discovered giant planets. 
Comparing the results of CoRoT and Kepler might help us to under- 
stand the role of the galactic environment in planet formation. 


MOST space telescope 

MOST is Canada’ first space telescope and it was funded by the Cana- 
dian Space Agency, with ground-based and scientific support from Aus- 
tria. It is a 15-cm-diameter telescope (often called “The Humble Space 
Telescope’) that was launched in 2003. Although designed to study stel- 
lar oscillations, it also delivered high-quality light curves on transiting 
exoplanets. MOST was the first space telescope to attempt to measure 
the optical albedo of an exoplanet and placed a low upper limit (less 
than 0.13) on the albedo of the transiting planet HD 209458b™. It also 
detected the transit of 55 Cancrie’, a planet with nearly twice the diam- 
eter, 8.3 M,, and with an ultra-short orbital period of only 0.74 days. 
The exoplanet 55 Cnc e was first discovered by ground-based Doppler 
surveys, but, with a photometric depth of only 0.04%, the transit would 
have been extremely difficult to detect with ground-based telescopes. 
Measurements from space were needed to show that this was an impor- 
tant transiting exoplanet. Ten years after its launch MOST continues to 
produce important results on transiting exoplanets. 
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It is worth mentioning that MOST, CoRoT and Kepler exemplify a 
strong spirit of international collaboration that is not often found in a 
highly competitive field such as exoplanet research. MOST data were 
provided to the CoRoT team to test on-board photometric reduction 
methods. Likewise, the CoRoT mission provided the Kepler team with 
early access to light curves so that they might better understand the 
influence of stellar activity. Each mission played an important part in 
the success of the subsequent one. 


Hubble Space Telescope 

Hubble has been a pioneering telescope for the study of exoplanetary 
atmospheres, with its suite of instruments such as the Space Telescope 
Imaging Spectrograph (STIS), the Near Infrared Camera and Multi- 
Object Spectrometer (NICMOS), the Advanced Camera for Surveys 
(ACS) and the Cosmic Origins Spectrograph (COS). Hubble was 
the first space-based facility to be employed in the study of exoplan- 
ets. It obtained, using STIS, what is arguably the finest light curve 
of an exoplanet transit, in this case the first known transiting planet 
HD 209458b"*. This transit measurement established that space-based 
photometry could achieve the photometric precision needed to detect 
Earth-sized planets. It is possible that CoRoT and Kepler had an eas- 
ier path towards approval owing to the pioneering measurements of 
Hubble. This telescope was also the first to detect an atomic species in an 
exoplanet atmosphere, namely sodium in HD 209458b””. Subsequently, 
in-transit spectroscopy with Hubble also detected the atomic species 
of hydrogen’* and magnesium” in this exoplanet. Molecular oxygen 
may have also been found in HD 189733b” — another bright, nearby 
transiting exoplanet — using Hubble archive data. 

We expected the atmospheres of giant planets to have molecules such 
as carbon monoxide, carbon dioxide, water and methane, much like 
the giant planets in our solar system. These features are best detected 
at infrared wavelengths. There have been reports of the detection of 
these molecules in the atmosphere of HD 209458b” and HD 189733b” 
using Hubble. 

The detection of these molecules in the atmospheres of exoplanets, 
however, is controversial as the results might depend on the subtle 
details of how the data are reduced. For example, the resulting in- 
transit (transmission) spectra for several transiting exoplanets can be 
markedly altered when using different instrumental models to account 
for systematic errors”. Thus, the robustness of the detection of water, 
carbon dioxide and methane comes into question. This only under- 
lines the difficulty, even when using instruments in space, of detecting 
molecular species in the atmospheres of exoplanets. The exoplanet field 
is truly ‘pushing the envelope’ when it comes to the detection of weak 
signals in data. We not only have to understand our instruments better, 
but also the star itself. For example, the presence of stellar spots and a 
poor knowledge of the physics of the stellar atmosphere will also affect 
our ability to detect atmospheric features of exoplanets™. The healthy 
discussion in the literature is a good example of how science is done: 
re-analyses of data and independent measurements to confirm sensa- 
tional detections. 

Hubble has also revealed that exoplanet atmospheres show evidence 
of Rayleigh scattering due to atmospheric haze. The transmission spec- 
tra of HD 189733b in the spectral range 0.550-1.050 jum obtained with 
the ACS of Hubble showed no indication of the expected sodium or 
potassium features, but rather a featureless slope that was consistent 
with Rayleigh scattering from clouds or haze in the upper atmosphere 
of the planet”. 

One of the more surprising results in the characterization of exoplanet 
atmospheres is that featureless atmospheric spectra may be common 
around Neptune-mass planets. The transmission spectrum of the trans- 
iting super-Earth GJ 1214b in the wavelength range 1-1.7 um taken 
with the Wide Field Camera-3 of Hubble” is flat and lacks any obvious 
spectral features. The observed spectrum rules out a hydrogen-rich 
atmosphere and is consistent with an atmosphere abundant in water 
vapour, or with a considerable amount of clouds. A similar result was 
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also obtained for the transiting GJ 436b Neptune-mass planet”. The 
detection of atmospheric absorption features in Neptune-like and 
super-Earth planets will be challenging and may require spectral obser- 
vations taken at higher spectral resolution. 


Spitzer Space Telescope 
Spitzer is an infrared facility that was launched in 2009. It was the last 
of NASAs Great Observatories programme. This 0.85-m-diameter tel- 
escope operates at the long wavelengths of 3 um to 180 pm. Planets 
orbiting close-in to their host stars have high equilibrium temperatures 
and thus most of their radiated light is at infrared wavelengths. For 
exoplanet studies, Spitzer has a marked advantage over Hubble, which 
operates mostly at optical and near infrared wavelengths. 
Measurements were made of the transiting planet HD 189733b~* 
using the 8-ym channel of the Infrared Array Camera (IRAC) on board 
Spitzer. These have produced a textbook example of the primary transit, 
the planet occultation and the variations due to the changing phases of 
the planet as it orbits the star (Fig. 2). The phase curve was used to map 
the temperature distribution on the exoplanet surface. This yielded a 
day and night temperature of 1,212 K and 973 K, respectively. Surpris- 
ingly, the map shows that the peak temperature for the planet is shifted 
by 16° from the sub-stellar point, the point on the planet directly under 
the star. The planet rotation is tidally locked to the orbital period so 
that the planet shows the same face to the star. Thus, one expects the 
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Figure 2 | Spitzer light curve for HD 189733. a, The observed phase 
variations with the transit and planet occultation visible. b, An expanded view 
to better show the phase variations. c, Brightness distribution of the surface of 
HD 189733b derived from the light curve. Figure reprinted with permission 
from ref. 28. 
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sub-stellar point to be the hottest, but it is not. One possible explanation 
for this phenomenon is that exoplanetary winds re-distribute the heat 
on the planet. This may be the first evidence of exoplanetary weather. 

Itis worth mentioning that exoplanet occultations and phase variations 
of an exoplanet at optical wavelengths were first detected by CoRoT”. 
These types of measurements made by Spitzer (infrared), Kepler (opti- 
cal) and CoRoT (optical) have revealed that the albedo of most giant 
exoplanets is typically 10% or less, or comparable with that of the Moon. 
By comparison, Jupiter has an albedo of about 50%. Hot Jupiters are essen- 
tially dark and this is consistent with the early predictions from theoretical 
models of giant-planet atmospheres”. One exception to this is Kepler-7b, 
which has an almost Jupiter-like albedo of 0.32 (ref. 31). 

In 2009, Spitzer exhausted its liquid helium cryogen, effectively 
stopping all observations in the long-wavelength channels. However, it 
continues to make important contributions using the 3.6-um and 4.5- 
um channels in the ‘warm Spitzer’ mode. For example, warm Spitzer 
independently detected the transit of 55 Cnc e”’ and determined its 
brightness temperature of 2,360 K*. So far, Spitzer has measured the 
temperatures of several dozen exoplanets. 


Future space missions 

Space-based investigations of exoplanets are currently in transition. Two 
important missions have experienced hardware failures. CoRoT lost 
its two remaining charge-coupled-device detectors in November 2012. 
Kepler lost its second reaction wheel in August 2013, thus compromis- 
ing its ability to obtain the precise light curves needed for the detection 
of Earth-sized planets. Fortunately, the mission has been ‘retooled’ as 
Kepler 2, which will perform ‘step and stare’ observations on selected 
stellar fields in the ecliptic (see Review by Lissauer et al. on page 336). 

Other space missions have fallen victim to budget constraints. The 
Canadian Space Agency has chosen to cut funding for MOST and it 
seems likely that 2014 will be its last year of operation. Spitzer has also 
been earmarked for shutdown by NASA, owing to budget woes. Both 
missions are still capable of making considerable contributions to exo- 
planet studies. The exoplanet community will certainly feel the impact 
of the loss of both these facilities as they made unique and unprec- 
edented measurements that were just not possible from the ground. It 
is hoped that the respective agencies can find the monetary means to 
continue the operation of these important space missions. 

Despite these setbacks, the future of space-based studies is bright, 
owing to a series of upcoming missions that will focus on relatively 
bright stars. The characterization of transiting planets is clearly best 
done on bright host stars. Doppler measurements for the accurate deter- 
mination of the stellar mass requires high-resolution spectrographs, 
which can only be effectively used on relatively bright stars. Likewise, 
the detection of the reflected and radiated light from the planet and 
its atmospheric features require lots of photons to detect the minute 
signature of the atmosphere. Unfortunately, the transit planets discov- 
ered by CoRoT, Kepler and ground-based transit surveys are generally 
around faint stars, making characterization studies of transiting exo- 
planets challenging. Several approved space missions seek to remedy 
this situation. 

The Bright Target Explorer (BRITE)-Constellation mission of Aus- 
tria, Canada and Poland consists of six nanosatellites with telescopes 
of 3-cm aperture that will survey a few hundred of the brightest stars in 
the sky. BRITE Constellation may find transiting Neptune-sized planets 
around some of these stars. Nanosatellites are inexpensive and could 
find a cost-effective niche in exoplanet studies. 

The Characterising Exoplanets Satellite (CHEOPS), a partnership 
between the ESA and the Swiss Space Agency, will be a 32-cm telescope 
that will search for transits of bright stars with exoplanets found by Dop- 
pler surveys. Its launch is planned for 2017. 

NASAs Transiting Exoplanet Survey Satellite (TESS) will use four small 
telescopes to conduct a 2-year survey of nearby bright stars with V-magni- 
tude brighter than 12th. (By comparison most CoRoT and Kepler targets 
have V-magnitude fainter than 12th.) Each field will be monitored for 
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approximately 1 month before moving to the next field. The goal of TESS 
is to find targets for atmospheric studies using the James Webb Space 
Telescope (JWST). TESS is expected to be launched in 2017. 

The Planetary Transits and Oscillations** (PLATO) mission of the 
ESA will also monitor several hundreds of thousands of the bright- 
est stars using an array of 32 small telescopes. Its field of view will be 
about 20 times that of Kepler. Unlike TESS, PLATO will observe stars 
for several years so that terrestrial planets in the habitable zone can be 
discovered. Because the target stars are relatively bright, the Doppler 
measurement to determine the mass for these small planets will be sig- 
nificantly easier than it was for Kepler and CoRoT exoplanet candidates. 

An important aspect of PLATO will be its asteroseismic component. 
Accuracy in planet mass and radius is limited by our uncertainty in the 
stellar parameters. For instance, transit measurements only yield the 
ratio of the planet to stellar radii. The precision and length of the PLATO 
light curves will allow astronomers to study the stellar oscillations of 
the host stars using asteroseismology. Analogous to how seismology is 
used to study Earth, asteroseismology can exploit stellar oscillations to 
determine accurate stellar and thus planetary parameters. With PLATO 
and ground-based follow-up measurements the planet mass will be 
determined with an error of 10%, the planet radius to within 2% and 
the age of the star (and thus planet) to within 10%. The planet-mass 
density measurements of PLATO should be able to distinguish whether 
a terrestrial planet is Mercury-like (with a large iron core), Earth-like 
(with an iron core and silicate mantle) or Moon-like (mostly silicates). 
It will also be able to determine accurate stellar ages for the host stars so 
that we can study the evolution of planetary systems as well. PLATO is 
scheduled for launch between 2022 and 2024. 

Finally, JWST, successor to Hubble and Spitzer, is a 6.5-m telescope. 
It will be a significant improvement on Hubble and Spitzer for char- 
acterization studies of exoplanet atmospheres because of the larger 
aperture and the fact that it will be optimized for measurements in the 
infrared. JWST should be able to unambiguously detect the presence 
of water and methane in giant exoplanets as well as the atmospheric 
features of exoplanets as small as Neptune. This should establish 
whether the atmospheres of hot-Neptunes are truly featureless. It 
is scheduled for launch in late 2018 and has a nominal mission life 
of 5 years. JWST should be able to characterize the atmospheres of 
exoplanets discovered by TESS and possibly PLATO if JWST has an 
extended mission life. 

There are a number of other space missions that have been proposed 
and are currently in the study phase. A description of all of these is 
beyond the scope of this short Review. With the approved and planned 
missions it can be said that we are entering a golden age of space-based 
studies of exoplanets that will ensure that an already vibrant and excit- 
ing field becomes even more so in the future. The scientific results from 
MOST, CoRoT, Kepler, Hubble and Spitzer set the foundation for the 
planning and development of these future missions. We anticipate a 
treasure trove of exciting exoplanet discoveries in the next decade that 
might help to answer the question of just how unique the planets in our 
Solar System are, and in particular Earth. m 
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Instrumentation for the detection 
and characterization of exoplanets 


Francesco Pepe’, David Ehrenreich’ & Michael R. Meyer? 


In no other field of astrophysics has the impact of new instrumentation been as substantial as in the domain of exoplanets. 
Before 1995 our knowledge of exoplanets was mainly based on philosophical and theoretical considerations. The years that 
followed have been marked, instead, by surprising discoveries made possible by high-precision instruments. Over the past 
decade, the availability of new techniques has moved the focus of research from the detection to the characterization of 
exoplanets. Next-generation facilities will produce even more complementary data that will lead to a comprehensive view 
of exoplanet characteristics and, by comparison with theoretical models, to a better understanding of planet formation. 


therefore, not surprising that the first (unconfirmed) detection 

of an extrasolar planet arose through this technique’. In 1984, 
another detection ofa planetary-mass object around the nearby star VB 8 
was reported, this time using speckle interferometry’, but subsequent 
attempts to locate it were unsuccessful. It was finally Doppler velocimetry 
that delivered the first unambiguous detection of a very low-mass com- 
panion (HD 114762b’). However, because its minimum mass (11 M;,) 
is near the upper limit of the planetary mass range, the discoverers cau- 
tiously announced it was a brown dwarf. In 1992, a handful of bodies of 
terrestrial mass were found’ and confirmed, by the measurement of tim- 
ing variation, to orbit the pulsar PSR1257+12. Although very powerful, 
this technique was restricted to a small number of very particular hosts. 
Doppler velocimetry, instead, could be applied, with good results, to 
almost any ‘quiet’ star showing a reasonable amount of narrow absorp- 
tion lines in its spectrum. The continuous improvement of this technique 
led, in 1995, to the discovery of the first giant planet around the Sun-like 
star 51 Pegasi’ and marked the start of an intensive era of discoveries (see 
Review by Mayor et al. on page 328). 

Since the discovery of 51 Peg b, microlensing, transit searches and 
direct imaging has delivered, together with Doppler velocimetry, an 
increasing number of planets and planetary candidates. Better instru- 
ments and improved detection limits have pushed our capabilities 
towards the detection of low-mass and small planets. Furthermore, 
the discovery of multi-planetary systems is the direct consequence of 
long-term, high-precision programmes. A new breakthrough was made 
thanks to the space-based transits searches Convection, Rotation and 
Planetary Transits (CoRoT)° and Kepler’. These missions have made a 
significant contribution to the statistical study of exoplanetary systems. 

In this Review, we will discuss techniques and instruments that have 
contributed the most to our understanding of exoplanets. We will also 
provide an overview of present and future instrumentation, and describe 
how the field is moving from simple detection and statistical studies to 
the characterization of individual planets, their interior and their atmos- 
pheric composition. 


A strometry is the most ancient technique of astronomy. It is, 


Stellar radial velocities 

Giant planets on short orbits induce radial-velocity variations in their 
host stars of several tens to a few hundreds of metres per second. Early 
Doppler velocimeters*” delivered 200-500 m s"' precision. With the use 


of a hydrogen-fluoride absorption cell the precision could be improved 
by one order of magnitude’. In the late 80s and early 90s an entire suite 
of new techniques and spectrographs'*"“* led to an improvement of the 
radial-velocity precision down to 3-15 ms‘. This better precision led, in 
turn, to the discovery of 51 Peg b’ and the era of giant-planet detection. 

Would it be possible to detect terrestrial mass exoplanets by the 
Doppler technique? Some astronomers believed that improving the 
instrumental precision would be a key element’*. Confirmation of this 
belief was provided by the discovery of  Arae c in 2004 (ref. 16). At 
only 10 times the mass of Earth and with an orbit of 9.6 days, this planet 
produces a stellar radial-velocity pull of 3 ms ' semi-amplitude. The 
detection of this tiny signal required a new generation of spectrographs, 
such as High Accuracy Radial Velocity Planet Searcher (HARPS)”. It 
represented the first step towards the detection and characterization 
of a vast population of Neptune-mass planets and super-Earths. The 
longer the temporal coverage and the better the instrumental precision, 
the smaller the radial-velocity signals (see for example the detection of 
a Cen B b’’) that could be detected (Fig. 1). 

The Doppler measurement consists of determining the wavelength 
of an identified spectral line and comparing it with the theoretical value 
it would have when transferred into the Solar System's rest frame. The 
Doppler equation links the measurement to the theoretical wavelength 
by the relative-velocity vector, finally delivering the projection of this 
vector in the direction of the line of sight (radial velocity). To increase 
the precision, the average, over several thousands of spectral lines, is 
computed. It should be noted, however, that the radial-velocity meas- 
urement is affected by several potential error sources that have been dis- 
cussed extensively'?!*1???. The main error sources are: photon noise!*”!, 


instrumental errors'’*”’; spectrograph-illumination effects”; spectral 


contamination’*”*; and stellar ‘noise’, commonly referred to as stel- 
lar jitter. The term stellar jitter masks various stellar causes that produce 
radial-velocity effects at all timescales and of different magnitude. The 
discussion of all these effects lies beyond the scope of our Review. Nev- 
ertheless, it is important to be reminded that stellar jitter is probably 
the strongest limitation for Doppler velocimetry when aiming for sub- 
metre-per-second precision. 

Present and future Doppler spectrographs need to address the men- 
tioned limitations. As a first step, telescope size should be increased 
because high-spectral-resolution measurements are photon-starved, 
even for relatively bright targets. The gain obtained with a large telescope 
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is, however, easily lost if spectral resolution is low. In fact, for unresolved 
spectral lines the measurement precision increases significantly with 
increasing spectral resolution”. In the photon-noise-limited regime the 
error &,,,, on the line-centre measurement can be estimated by: 


15 


Evary = aaa (1- 5) 


where ois the measured width of the spectral line as seen through the 
spectrograph; c = (Ijin—Iy)/Iy is the measured line contrast; and EW = oc 
is the equivalent width. J, and [,,,, designate the photoelectron counts per 
resolution element in the continuum and the line minimum, respectively. 
It must be noted that the resolution element can be represented either by 
the detector pixel or by the wavelength unit as long as all the parameters 
are expressed in the same units. It is now commonly agreed that a spectral 
resolution of at least R = \/AA = 100,000 should be used to guarantee the 
best precision on slowly rotating, quiet, solar-type stars. Spectral reso- 
lution and adequate line sampling not only allow us to achieve better 
signal-to-noise per spectral line, but also to reduce possible instrumental 
errors in both the radial-velocity measurement and the calibration pro- 
cess. To first order approximation, instrumental errors scale with the size 
of the resolution element (expressed in wavelength units). Unfortunately, 
with increasing telescope size, spectral resolution is a considerable driver 
of cost. For seeing-limited instruments the optical etendue E(E=A x Q, 
the beam cross-section area times the solid angle) increases with the 
telescope size, and so does the instrument size if the spectral resolution 
is kept fixed*. In the era of 8-m class and extremely large telescopes 
(ELTs), this aspect has become a technical and managerial challenge 
that is nevertheless successfully addressed by employing novel optical 
design concepts”. 

All future projects for radial-velocity spectrographs (Table 1) aim to 
detect rocky planets in the habitable zone (the distance to the star at 
which liquid water can persist on the surface of the planet)“ of a Sun- 
like and a low-mass star. To attain this objective they must be photon- 
efficient and precise to the sub-metre-per-second level. Photon efficiency 
is obtained with optimized designs and high-spectral resolution. High 
precision also requires the control of all instrumental effects. State-of- 
the-art instruments are therefore designed to be stable”. Gravity invari- 
ance and illumination stability of the spectrograph are crucial aspects 
that can only be obtained through a fibre feed’. Despite the intrinsic 
light-scrambling properties of optical fibres*”' it was soon realised that 
the illumination produced by a circular optical fibre depends on how the 
starlight is fed into the fibre. In other words, motions of the stellar image 
at the fibre entrance would produce a change in the illumination of the 
spectrograph and mimic a radial velocity effect. Considerable effort was 
invested in improving image scrambling by using double scramblers'*” 
and octagonal fibres’. Effective improvements have already been dem- 
onstrated on operational instruments”. 

Any instrumental effect that produces a distortion or a shift of the 
spectral line in the detector-pixel space will be interpreted, if not detected 
and recognized, as a wavelength change and thus a Doppler shift”. Two 
methods of tracking the instrumental profile changes have success- 
fully been applied in the past. The first is to superimpose an absorption 
spectrum ofa reference gas cell’”'*”* on the stellar spectrum, such that 
the instrumental profile is continuously measured. This so-called self- 
calibration technique is particularly useful and effective in spectrographs 
with varying instrument profiles, as is the case for slit spectrographs. The 
disadvantages of this technique are the restricted bandwidth of the gas- 
cell spectrum, the loss of efficiency due to absorption in the light path, 
and the necessity for a sophisticated deconvolution process to recover the 
stellar spectrum and thus the radial velocity. This latter step requires the 
introduction of many additional parameters for spectral modelling. To 
obtain a given precision, higher signal-to-noise spectra must be acquired. 
The second method, the ‘simultaneous reference technique’’’””, is con- 
ceptually opposite. It assumes a stabilized instrumental profile that does 
not change between two wavelength calibrations of the spectrograph, 
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Figure 1 | Radial-velocity semi-amplitude of planetary-mass 
companions. All planets discovered by the Doppler technique from 1989 to 
present day are plotted. Remarkably, the detection limit improved by three 
orders of magnitudes in less than three decades. The underlying data were 
retrieved from http://www.exoplanets.org. 


such that the determined relationship between the detector pixel and 
the wavelength remains valid over these timescales (typically a night). 
A second channel carrying a spectral reference is continuously fed to the 
spectrograph to monitor and correct for potential instrumental drifts 
or instrument profile changes. It must be guaranteed, however, that 
the changes that the scientific and the reference channels are subject to 
are identical over the timescale of one observing night. Therefore, the 
whole design of the instrument must be optimized for stability of the 
instrumental profile, requesting fibre feed and light scrambling, as well 
as pressure, mechanical, thermal and optical stability. The effort is com- 
pensated for by an unrestricted spectral bandwidth and the acquisition 
of an ‘uncontaminated’ scientific spectrum. 

Although, in the case of the self-calibration technique, the instrument 
profile is supposed to be recoverable by deconvolution, there seems to be 
general agreement that low-order instrument-profile changes must, in 
any case, be avoided and that a stable instrument will eventually deliver 
more precise measurements. There is also agreement that better cali- 
bration sources are needed. The laser-frequency com 97-82. when avail- 
able at full potential, will provide the required calibration accuracy and 
precision. In the meantime, alternative sources are being developed, for 
example, passive Fabry—Pérot cavities © for simultaneous reference, 
or actively stabilized Fabry—Pérot systems for wavelength calibration”. 

The near-infrared wavelength region is becoming increasingly inter- 
esting for two other reasons. First, M dwarfs are much brighter in the 
infrared than in the visible”. These stars are cooler and thus their habit- 
able zone lies closer to the host star. In addition the parent star is less 
massive. Potential habitable planets are, therefore, more easily detected 
by radial velocity”. Second, the influence of spots is strongly reduced in 
the near-infrared compared with the visible”. Furthermore, a com- 
parison with radial-velocity determined in the visible wavelength range 
might help to discriminate a planet-induced velocity change from a stel- 
lar effect. For these reasons, many new instruments’'”° (Table 1) will 
operate in the infrared wavelength domain. The use of adaptive optics” 
could be a means of reducing the size and cost of these instruments. 


Transit photometry and spectroscopy 

There are two approaches to detecting planetary transits: surveying as 
many stars as possible with one or several photometers in the hope of 
detecting new exoplanets through their transits, and photometrically 
following up planets discovered by Doppler velocimetry around their 
predicted inferior conjunction time. (The inferior conjunction denotes 
the orbital configuration where the planet lies between its host star and 
the observer; a transit occurs at the inferior conjunction if the orbital 
plane of the planet is aligned with the line of sight.) In the first method, 
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Table 1| Non-exhaustive table of present (active) and future (approved) high-precision Doppler velocimeters 


Instrument/technique Telescope/observatory Start of operations Band (um) Spectral resolution Efficiency (%) Precision (ms?) 
Hamilton'®°/self-calibration Shane 3 m/Lick 1986 0.34-1.1 30,000-60,000 3-6 3 
UCLES?®/self-calibration 3.9-m AAT/AAO 1988 0.47-0.88  -100,000 NA 3-6 
HIRES?2/self-calibration Keck I/Mauna Kea 1993 0.3-1.0 25,000-85,000 6 1-2 
CORALIE’?/sim. reference EULER/ESO La Silla 1998 0.38-0.69 60,000 5 3-6 
UVES'®?/self-calibration UT2-VLT/ESO Paranal 1999 0.3-1.1 30,000-110,000 4-15 2-2.5 
HRS?®?/self-calibration HET/McDonald 2000 0.42-1.1 15,000-120,000 6-9 3-6 
HDS!®*/self-calibration Subaru/Mauna Kea 2001 0.3-1.0 90,000-160,000 6-13 5-6 
HARPS?*/sim. reference 3.6 m/ESO La Silla 2003 0.38-0.69 115,000 6 <08 
FEROS-II!8°/sim. reference 2.2 m/ESO La Silla 2003 0.36-0.92 48,000 20 10-15 
MIKE?®°/self-calibration Magellan II/Las Campanas 2003 0.32-1.00 65,000-83,000 and 22,000-28,000 20-40 5 
SOPHIE'®’/sim. reference 1.93 m/OHP 2006 0.38-0.69 39,000 and 75,000 4and8 2 
CRIRES'**/self-calibration UT1-VLT/ESO Paranal 2007 0.95-5.2 -100,000 15 5 
PFS'®/self-calibration Magellan II/LasCampanas 2010 0.39-0.67 38,000-190,000 10 1 
PARAS!°°/sim. reference 1.2 m/Mt. Abu 2010 0.37-0.86 63,000 NA 3-5 
CAFE!?/sim. reference 2.2 m/Calar Alto 2011 0.39-0.95 ~67,000 25 20 
CHIRON?°?/self-calibration 1.5 m/CTIO 2011 0.41-87 80,000 15 <l 
HARPS-N*/sim. reference TNG/ORM 2012 0.38-0.69 115,000 8 <l 
LEVY?*/self-calibration APF/Lick 2013 0.37-0.97. 114,000-150,000 10-15 <l 
EXPERT-III4/NA 2-m AST/Fairborn 2013 0.39-0.9* 100,000* NA NA 
GIANO’?/self-calibration TNG/ORM 2014 0.95-2.5 50,000 20 NA 
SALT-HRS'°°/self-calibration SALT/SAAO 2014 0.38-0.89* 16,000-67,000* 10-15* 3-4* 
FIRST'4/NA 2-m AST/Fairborn 2014 0.8-1.8* 60,000-72,000* NA NA 
IRD7?/sim. reference Subaru/Mauna Kea 2014 0.98-1.75* 70,000* NA 1 
NRES/NA 6 x 1-m/LCOGT 2015 0.39-0.86* 53,000* NA 3F 
MINERVA/self-calibration 4 x 1-m/Mt. Hopkins 2015 0.39-0.86* NA (Kiwispec)* NA [* 
CARMENES’“/sim. reference Zeiss 3.5-m/Calar Alto 2015 0.55-1.7* 82,000* 10-13* ts 
PEPSI!%°/sim. reference LBT/Mt. Graham NA 0.38-0.91* —120,000-320,000* 10* NA 
HPF’4/sim. reference HET/McDonald NA 0.98-1.40* 50,000* 4* 1-3* 
CRIRES+/self-calibration VLT/ESO Paranal 2017 0.95-5.2* = _-100,000* 15* <5* 
ESPRESSO**/sim. reference All UTs-VLT/ESO Paranal 2017 0.38-0.78* 60,000-200,000* 6-11* 0.1* 
SPIROU’°/sim. reference CFHT/Mauna Kea 2017 0.98-2.35* 70,000* 10* iL 
G-CLEF*?/sim. reference GMT/Las Campanas 2019 0.35-0.95* 120,000* 20* 0.1* 


For the spectral band and the spectral resolution the maximum value is given. The total efficiency 
precision was estimated from published orbits or standard star’s velocities. Historical instruments 


as been extrapolated to include slit losses, and telescope and atmospheric throughput. The radial-velocity 


have not been listed. It is interesting to note that mos 


of the planets discovered between 1995 and 2003 


were detected using a small number of precision instruments: High Resolution Echelle Spectrometer (HIRES) at the 10-m Keck i telescope in Hawaii, CORALIE at the European Southern Observatory (ESO) 


3.6-m telescope in La Silla, The Hamilton Spectrograph at the Shane 120-inch telescope at Lick, E! 


LODIE at the 1.93-m telescope of t 


e Haute-Provence 


Observatory’’, Advanced Fiber-Optic Echelle (AFOE) on 


the 1.5-m telescope at the Whipple Observatory!” University College London Echelle Spectrograph (UCLES) at the Anglo-Australian Telescope (AAT), Coudé Echelle Spectrograph!” on the 2.7-m telescope, 


the Sandiford Cassegrain Echelle spectrograph’”* on the 2.1-m telescope and the High-Resolution 


Spectrograph (HRS) at the 


Hobby-Eberly Telescope (| 


HET), all of them at the McDonald Observatory. After 


2003 the HARPS spectrograph opened a new window on the domain of super-Earths and mini-Neptunes by improving the radial-velocity precision below the metre-per-second level. Since then, the metre- 
per-second precision has become a ‘standard’ and a goal for most of the Doppler-velocimeter projects presented in the table. AAO, Australian Astronomical Observatory; APF, Automated Planet Finder; AST, 
Automatic Spectroscopic Telescope; CAFE, Calar Alto Fiber-fed Echelle; CARMENES, Calar Alto High-Resolution Search for M dwarfs with Exo-Earths with Near-Infrared and Optical Echelle Spectrographs; 
CFHT, Canada-France-Hawaii Telescope; CTIO, Cerro Tololo Inter-American Observatory; ESPRESSO, Echelle Spectrograph for Rocky Exoplanet and Stable Spectroscopic Observations; EXPERT-III, Extremely 
High Precision Extrasolar Planet Tracker Ill; FEROS-II, Fiberfed Extended Range Optical Spectrograph; FIRST, Florida Infrared Silicon Immersion Grating Spectrometer; G-CLEF, GMT-CfA Carnegie, Catolica, 
Chicago Large Earth Finder; GMT, Giant Magellan Telescope; HPF, Habitable-zone Planet Finder; HDS, High Dispersion Spectrograph; IRD, Infrared Doppler; LBT, Large Binocular Telescope; LCOGT, Las 
Cumbres Observatory Global Telescope Network; MINERVA, Miniature Exoplanet Radial Velocity Array; MIKE, Magellan Inamori Kyocera Echelle; NRES, Network of Robotic Echelle Spectrographs; OHP, 
Observatoire de Haute Provence; ORM, Observatorio del Roque de los Muchachos; PARAS, PRL Advanced Radial-velocity All-sky Search; PEPSI, Potsdam Echelle Polarimetric and Spectroscopic Instrument; 
PFS, Planet Finder Spectrograph; SAAO, South African Astronomical Observatory; SALT, Southern African Large Telescope; SOPHIE, Spectrographe pour |’Observation des Phénoménes des Intérieurs 
Stellaires et des Exoplanétes; SPIROU, SpectroPolarimétre Infra-Rouge; TNG, Telescopio Nazionale Galileo; UT, Unit Telescopes; UT2-VLT, Unit Telescope 2-Very Large Telescope; UVES, Ultraviolet and Visual 
Echelle Spectrograph. NA, not available or non-reliable information; sim. reference, simultaneous reference technique. 


*Indicates design values. 


the expected depth of the transit light curve dictates the photometric 
precision needed — for Jupiter-sized planets in transit across Sun- 
like stars the transits can be detected from the ground with amateur 
telescopes. Hot Jupiters, however, are only found orbiting about 1% of 
nearby solar-type stars”’, requiring observers to maximize the number of 
surveyed stars. Bright main sequence stars can be surveyed over a large 
fraction of the sky by wide-field cameras with small aperture telescopes 
and charge-coupled devices (CCDs), as illustrated by the Wide Angular 
Search for Planets” (WASP). Observations from a single location are 
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limited, however, by the duration of the night. Time and sky coverage 
can be further improved with networks of small telescopes that relay 
from different longitudes, such as the Hungarian Automated Telescope 
Network* (HATNet) or the Trans- Atlantic Exoplanet Survey” (TrES). 
The other strategy is to stare at crowded stellar fields. The 1.3-m tele- 
scope of the Optical Gravitational Lensing Experiment (OGLE) yielded 
the first discoveries of exoplanets through the transit method” by apply- 
ing this strategy. The confirmation of these detections with velocimetry”, 
however, required a large observational effort because of the faint optical 
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magnitudes (denoted V) of the stars surveyed (V = 14-16 mag). The 
first space missions dedicated to the search for transiting exoplanets, 
CoRoT®™ and Kepler’, also stared at dense fields with high-cadence 
precise (relative) photometry (see the Reviews by Hatzes on page 353 
and Lissauer et al. on page 336). Together, these satellites have surveyed 
several hundred thousand stars. Radial-velocity follow-up of CoRoT 
and Kepler exoplanet candidates remains difficult owing to the faint 
magnitudes of the host stars and the large number of targets needing 
follow-up. The faintness of the host stars also sets severe limits on the use 
of photon-starved techniques, such as transmission spectroscopy for the 
study of the planetary atmospheres. This technique requires bright host 
stars (Fig. 2), such as the hosts of planets discovered through velocimetry 
and later detected in transit. Only nine such exoplanets are known so 
far, but future space missions will search for more of these planets. In 
the meantime, and from the ground, planets transiting small stars such 
as M dwarfs are being looked for, because the transit signal is inversely 
proportional to the square of the stellar radius. The MEarth survey”, 
composed of eight identical robotically controlled 40-cm telescopes 
with CCD detectors, found a super-Earth* that is especially amenable 
to follow-up atmospheric studies” ”’. 


Studies of exoplanetary atmospheres 

The hot gas giant HD 209458b was the first exoplanet captured in transit 
by two separate small telescopes”, with a relative photometric preci- 
sion of 0.2-0.4%. This transit was also the first exoplanet-related event 
observed from space: the 2.4-m Hubble Space Telescope measured the 
transit light curve to a precision of 110 p.p.m. per minute of observa- 
tion”. The photometric observations of HD 209458b were obtained by 
integrating the stellar spectra collected before, during and after the transit 
by the Space Telescope Imaging Spectrograph (STIS)** CCD detector. 
These spectra were recorded with a medium-resolution (R = 5,540) 
grism of medium band pass, notably including the sodium doublet at 
589 nm. The first transmission signature of an exoplanetary atmosphere 
was reconstructed from this data set by measuring, during the transit, an 
extra absorption of 200 p.p.m. in the sodium lines”*. The far-ultraviolet 
channel of the STIS instrument, which collects ultraviolet photons with a 
multi-anode microchannel array (MAMA) detector, was used to observe 
the transit of HD 209458b over the stellar Lyman-a emission of atomic 
hydrogen at 121 nm. These measurements led to the discovery of an 
extended upper atmosphere to the planet”. 

HD 209458b remained, for quite some time, the only known transiting 
exoplanet. By the time additional transiting exoplanets were announced 
(in 2004), STIS had experienced a power-supply failure. The instrument 
was only repaired in 2009 during the last servicing mission of Hubble. 
Arguably, the main effect of the STIS failure was to shift the field of exo- 
planetary atmospheres into the infrared. After 2004, and despite suc- 
cessful attempts to record precise transit light curves with the Advanced 
Camera for Surveys on board Hubble”, the 85-cm Spitzer Space Tel- 
escope became the prime observatory not only for transits, but also for 
eclipses of planets by their stars, which can occur at superior conjunc- 
tions (the orbital configuration opposite the inferior conjunction, when 
the planet passes behind the star). Broadband photometry of these 
eclipses with the Infrared Array Camera ([RAC)” on Spitzer revealed 
the thermal emission from exoplanets, the first example of direct detec- 
tion of light from a planet orbiting a star’. The instrument has four 
broadband infrared channels collecting light on two detectors made of 
indium antimonide (3.6 ttm and 4.5 um channels) and arsenic-doped 
silicon (5.8 pm and 8.0 wm channels). 

The first infrared observation of a planetary transit” was obtained 
with the Multiband Imaging Photometer for Spitzer (MIPS) at 24 um. 
These observations were limited by the low stellar flux in the mid-infra- 
red. Furthermore, transit observations in the near infrared exhibited 
large instrumental effects, precluding the detection of molecular sig- 
natures. Both photometry with IRAC’™” and spectroscopy with the 
Near-Infrared Camera and Multi-Object Spectrometer (NICMOS)’” on 
Hubble yielded non-reproducible results or were of insufficient quality 
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Figure 2 | Detectability of planetary atmospheres. The signal of one 
atmospheric scale height seen in transmission during transit is plotted against 
the stellar J magnitude. The signal is calculated in parts per million (p.p.m.) as 
2 x 10° (AF/F) (H/R,), where AF/F is the transit depth and H is the atmospheric 
scale height. This quantity scales here with the equilibrium temperature of 
the planet and is inversely proportional to the acceleration of gravity at the 
surface of the planet and the mean molar mass (1) of the atmosphere. The 
atmospheric signal is proportional to the planet mean density. The size of 
the circles (and the thickness of the colour bars) scales with the density to 
show this effect. For giant exoplanets (brown circles), the atmosphere is 
assumed to be primarily composed of molecular hydrogen (H,) and helium 
(w=2.3¢ mol’). For lower-mass planets, such as Neptunes (10 < M,<60M,, 
where M, is the mass of Earth and M, is the mass of the planet) and super- 
Earths (M, < 10 M,), the atmospheric composition is unknown and the colour 
bar extents represent all possible signal values assuming hydrogen and helium 
(= 2.3 g mol ', brown) and water (H,O, 1. = 18 g mol ', blue) dominated 
atmospheres for Neptunes, and molecular nitrogen (N,, 1. = 28 g mol ', light 
blue), and carbon dioxide (CO,, p= 44 g mol”, red) dominated atmospheres 
for super-Earths, in addition to the two earlier types. Approximate Hubble 
Space Telescope (HST) and JWST 3-o detection limits (orange and red 
lines, respectively) are shown. Only super-Earths and Neptunes with a mass 
determined to better than 20% are represented. 


for unambiguous interpretation’ "””. Eclipse spectroscopy of the dayside 
emission of HD 189733b obtained with the third instrument on Spitzer, 
the Infrared Spectrograph (IRS)! providing low-resolution (R = 80) 
and spectral coverage from 5 um to 14 tm, also had to be corrected for 
instrumental effects'’*. The IRS data nonetheless provided evidence for 
molecular absorption in an exoplanet atmosphere’. Unfortunately, the 
use of IRS was terminated after Spitzer ran out of cryogen in May 2009. 
Meanwhile, Spitzer continues observing with IRAC 3.6-um and 4.5-um 
channels, now commonly used to obtain precise transit light curves of 
exoplanets down to the super-Earth size regime’’*"””. 

Ground-based atmospheric characterization of exoplanets advanced 
through the use of high-resolution spectrographs. The signature of 
sodium in the atmosphere of HD 209458b was found""* in data taken 
with the High Dispersion Spectrograph (R = 45,000) at the Subaru 8-m 
telescope’””. The technique, differential spectroscopy, involves calibrat- 
ing the signal in the spectroscopic features with the continuum signal 
in the vicinity of the features. The ‘absolute’ transit depth is lost, but the 
transmission signal can be retrieved, assuming that telluric absorption 
can be sufficiently calibrated. Another method is to calibrate the wave- 
length-dependent signal using other stars within the field of view of the 
instrument. This can be achieved in spectrophotometry for systems with 
nearby reference stars’’*"”” or in spectroscopy with slit masks positioned 
on the target and on several reference stars in the field”. A breakthrough 
was made possible by the Cryogenic High-Resolution Infrared Echelle 
Spectrograph (CRIRES) on the Very Large Telescope (VLT). Its high reso- 
lution (R = 100,000), although over a narrow (50 nm) wavelength infrared 
region, allows tracking of the wavelength shift of individual spectral fea- 
tures composing molecular bands of water, carbon monoxide or carbon 
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Figure 3 | Mass and semi-major axis of known planets. Planetary mass is 
plotted as a function of the semi-major axis (the distance to the host star). 
Solar-system planets are shown by black circles (Me, Mercury; V, Venus; Ma, 
Mars; J, Jupiter; S, Saturn; U, Uranus; N, Neptune) and Earth (E) is in blue. 
Exoplanets detected with different techniques and instrumentation are shown. 
Doppler velocimetry (white), transit with a measured mass (orange), direct 
imaging (blue), microlensing (pink), and pulsation timing (green). Among 
the direct-imaging planets, only ten (labelled) were found within 100 au of 
their host and a mass ratio between the companion and its host star q < 0.02. 
Data underlying this plot were retrieved from the Exoplanet Encyclopaedia”. 


dioxide that are present in the atmosphere of the planet as the planet orbits 
the star’””’”. The method works for transiting and non-transiting planets 
alike, giving access to the brightest exoplanetary systems, such as that of 
t Bodtis™*. Its application to the directly imaged planet B Pictoris b’* led 
to the determination of the spin velocity of the planet’”. 

The refurbishment of Hubble in May 2009 enabled the recovery of 
STIS capabilities and the start of operations of both the Cosmic Origins 
Spectrograph’”° (COS) in the far-ultraviolet and the Wide-Field Cam- 
era-3 (WFC3)'” in the near-infrared. COS and STIS provided observa- 
tions and confirmation of the atmospheric mass loss from HD 209458b 
in the singly ionized carbon lines at 133 nm’. These measurements were 
extended to other exoplanets’”’®*. Visible STIS spectra revealed atomic 
signatures and the presence of light scattering processes in the upper 
atmospheric layers of HD 189733b’”””™, and observations of the eclipse 
of HD 189733b with a low-resolution grating from 290 nm to 570 nm 
also yielded the first chromatic measurements of a planetary albedo’. 
The WFC3 infrared channel was successfully used for slitless grism spec- 
troscopy of exoplanetary transits in the near-infrared, achieving near- 
photon-noise transmission spectroscopy of super-Earths, Neptunes and 
gas giants between 1.1 umand 1.7 pm”’**", and detecting the 1.38-um 
water band in some of these planets. 


Direct imaging and astrometry 

Despite many years of technological development, the search for ideal 
targets, improved analysis algorithms and investment in observing time 
on leading telescopes, it was not until 2008 that the first direct images of 
an exoplanetary system around a star were obtained. The multi-planet 
system HR 8799, with all planets’ orbiting the intermediate mass host 
star in the same rotation sense, was a remarkable (and a lucky) break- 
through. Interpretation of the contemporaneous discovery of a faint 
point source around the debris disk host star Fomalhaut"” has turned out 
to be more complex than anticipated™*. Finally, at the end of 2008, a giant 
planet was found around  Pic'**'*”"” within the prototypical debris disk 
first imaged in the early 1980s”. These discoveries were preceded by 
several others (some of which were spurious), often around very young 
objects still in the process of becoming a star. For example, the compan- 
ion to 2MASSWJ 1207334-393254 (a very young brown dwarf) was dis- 
covered’ through adaptive-optics-assisted near-infrared imaging with 
the NACO instrument on the VLT. This discovery was notable because 


150 


362 | NATURE | VOL 513 | 18 SEPTEMBER 2014 


the system is very young, making detection ofa self-luminous planetary 
mass object easier; the central host object is of very low mass and thus of 
modest luminosity relative to the planetary mass companion; and NACO 
is equipped with an infrared wave-front sensor, which is important to 
allow observations of this class of cool primaries. However, the mass 
ratio (q) of the brown dwarf to the companion is consistent with many 
examples of binary star systems of higher mass. So far, there have been 
10 objects found within 100 au of their host with a mass ratio between 
the companion and the host star of q < 0.02 (Fig. 3; http://exoplanet.eu/). 
These restrictions suggest that they may have formed like planets in our 
Solar System, but this is not at all certain. There are dozens of objects that 
have larger mass ratios (particularly around very low-mass primaries), 
as well as objects with low-mass ratios, but found at larger radii (out 
to more than 1,000 au). One major caveat to these studies is that the 
masses are inferred from theoretical models’” based on the shape of the 
spectral energy distribution and luminosity, as well as knowledge about 
the central star (primarily age, but also composition). 

State-of-the-art instruments require advanced adaptive optics to 
correct for the blurring effects of Earth’s atmosphere’. Although the 
diffraction limit improves at shorter wavelengths, high performance 
adaptive optics are more challenging, leading to compromises for instru- 
ment design between 0.5-5.0 um. Even at the diffraction limit of an 8-m 
class telescope, it is only possible to reach orbital separations of 3 au 
at 1.65-um wavelength for stars out to a 50 pc distance. The younger a 
planet is, the hotter and brighter it is, making its detection and charac- 
terization easier. Nearby stars tend to be old (1-3 gigayear) and the 
youngest objects, which are more rare, are located at greater distances. 
Thus, another compromise needs to be found between available target 
sample and ease of detection, which translates directly into a balance 
between detectable mass (better for younger, more distant objects) and 
orbital separation (better for nearby stars). Results so far suggest that 
massive gas-giant planets (> 2 M,) are rare at large orbital radii (for 
example, beyond 50 au). However, new instruments utilizing extreme 
adaptive optics (resulting in an increase of hundreds to thousands of 
actuators controlling the shape of the deformable mirror, for example, the 
Spectro-Polarimetric High-Contrast Exoplanet Research (SPHERE)’” 
instrument and the Gemini Planet Imager (GPI)'**) will improve the 
inner working angle that can be reached at all wavelengths of operation, 
although in particular it will open up the possibility of Strehl ratios above 
30% in the red visible’. It is also worth mentioning that great improve- 
ments in data acquisition modes and analysis software (differential imag- 
ing through angular, polarimetric and spectral difference) have greatly 
enhanced planet-detection capabilities'*"”. In addition, the develop- 
ment of diffraction-suppression optics continues — as observations are 
contrast-limited close to the star. In the photon-noise limit, which is not 
often reached even around early type bright stars, sparse aperture mask- 
ing'® and coronography can improve the achievable contrast limit using 
techniques such as apodizing phase plates, vector vortex'”, phase- 
induced amplitude apodization’® and classical Lyot coronography’™. 
Marked improvements in diffraction suppression, stability and quality 
of adaptive optics, as well as in post-processing algorithms, are needed 
to reach the fundamental background-limited sensitivity close to the 
diffraction limit. The inner working angle, at which the background 
limit is reached, is 10 times larger than the diffraction limit. The imple- 
mentation of low-noise infrared wave-front sensors is another key area 
of development, particularly in their application to imaging surveys of 
fainter lower-mass stars and brown dwarfs. Building the observational 
data to constrain the frequency of planets as a function of planet mass, 
orbital separation and primary-star mass will provide powerful tests for 
theories of planet formation. 

The James Webb Space Telescope (JWST) will launch in 2018 and 
will provide powerful capabilities for direct imaging, including corona- 
graphy. All of its instruments will make great contributions to finding 
and characterizing exoplanets resolved from their host stars, including 
some of those already known today. In particular, its short-wavelength 
imager, Near Infrared Camera (NIRCam), will be able to detect planets 


© 2014 Macmillan Publishers Limited. All rights reserved 


below the mass of Saturn beyond 30 au around close-by stars. The 
Near-Infrared Imager and Slitless Spectrograph (NIRISS) will utilize a 
sparse aperture mask to detect bright companions below the diffraction 
limit at 1-2.3 um wavelength. It will be particularly useful for surveys 
of very young stars for which planetary companions will be brightest 
relative to the central star. The Mid-Infrared Instrument (MIRI), the 
long-wavelength camera/spectrograph on JWST, will provide additional 
characterization of planetary atmospheres from 5 jum to 28 tum, and the 
Near-Infrared Spectrograph (NIRSpec, 1-5 jum) will be equipped with 
an integral field spectrograph that is capable of providing high-quality 
spectra of close companions. 

Although JWST will be the most powerful telescope ever in terms 
of infrared sensitivity, it will not provide enhanced spatial resolution 
compared with the current generation of 6-10-m telescopes and will not 
provide unique capabilities for high-contrast imaging at inner working 
angles below 0.1 arcsec. Because we know that the distribution of giant 
gaseous planets rises with orbital radius out to 3 au, and because mas- 
sive gas giants are rare beyond 30 au, it is likely that most Jupiter-mass 
planets will be found at intermediate separations. The next generation 
of ELTs will enable us to cross the 10 au threshold in angular resolution 
of accessible targets, pushing the detectable separation down to 3 au and 
enabling vast synergies between Doppler velocimetry and astrometry. 
The Large Binocular Telescope Interferometer (LBT1) is the first optical 
telescope with an effective resolution of a 22.8-m baseline’, although 
it is nota filled aperture, thus limiting its sensitivity. The European ELT 
(E-ELT), with its aperture of 39 m, will integrate a suite of imaging and 
spectroscopic instruments (HARMONI, MICADO, METIS and even- 
tually EPICS) to enable efficient imaging of exoplanets at diffraction- 
limited inner working angles below 0.1 arcsec. Similar instruments are 
planned for the two other ELT projects — the Thirty Meter Telescope 
(TMT) and the Giant Magellan Telescope (GMT). Considerable thought 
is being devoted to working out how to reach the fundamental back- 
ground limit when approaching the diffraction limit, which, along 
with sensitivity and spatial resolution of these ELTs, would represent a 
major breakthrough. The removal of so-called quasi-static speckles is 
the key; this can be achieved, in principle, through sophisticated cali- 
bration schemes for adaptive-optics systems to enable the commanded 
removal of speckles, or equally by sophisticated analysis of the wave- 
front sensor camera data and telemetry to analyse residual errors in post- 
processing”. In predicting the performance of these future telescopes 
some take a conservative approach, whereas others believe they could 
reach the ultimate limit. Either way, these ELTs will represent a huge 
breakthrough in the capacity to directly image planets around nearby 
stars. If the technological challenges are mastered, the E-ELT will have 
a reasonable chance of obtaining a direct image ofa super-Earth within 
1 au of the nearest stars’”. 

In late 2013, the European Space Agency launched Gaia, which has the 
ability to reach micro-arcsecond astrometric precision. Owing to better 
performances, this mission will allow the exploration ofa wider parameter 
space to detect motions in the plane of the sky due to the orbit of the host 
star and planet around a common centre of mass. As the precision will 
fall for fainter stars, Gaia will be sensitive to the lowest-mass planets only 
around stars in the solar neighbourhood, but will detect hundreds — if 
not thousands — of gas-giant planets within hundreds of parsecs'”. Fur- 
thermore, this will open up the synergistic possibility to directly image 
some of these objects, providing ground-truth for models of their evolu- 
tion. Ground-based astrometry will also play a part, as could JWST and 
other facilities. For instance, ground-based direct imaging could deliver 


astrometric measurements at 100 micro-arcsecond precision’”. 


A bright and multi-technique future 

In the past, our focus was on discovering new exoplanets and acquiring 
statistics about their diversity, which, in turn, concerned mainly external 
(orbital) parameters (Fig. 3). Now, interest is moving towards the detailed 
characterization of specific planets and planetary systems. Orbital 
parameters, host-star characteristics, synchronization and planetary 
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spin, irradiation, planet density and internal structure, atmospheric 
composition, and physical conditions must be characterized in order 
for us to understand the formation processes and the observed diversity. 

Increasing the number of targets amenable to further characteriza- 
tion is the prime goal of several dedicated space mission projects: the 
extension of the Kepler mission (K2)'”’, NASA’ Transiting Exoplanet 
Survey Satellite (TESS)'”* and ESA’s Planetary Transits and Oscillations 
of Stars mission (PLATO) will obtain photometric measurements 
of bright stars located almost everywhere in the sky, and thus find 
many new transiting planets around bright stars. These space missions 
will be complemented by new ground-based surveys dedicated to the 
search for transits across different types of stars, for example, the Next 
Generation Transit Survey (NGTS), the Search for Habitable Planets 
Eclipsing Ultra-cool Stars (SPECULOOS), the Exoplanets in Transit 
and Their Atmosphere (ExTrA) and the Multi-Site All-Sky Camera 
(MASCARA)"”. 

The planets transiting bright stars will enable follow-up observa- 
tions and characterization of the planets by other techniques. The 
Swiss-ESA spacecraft CHEOPS (Characterising Exoplanets Satel- 
lite)'””"”* will, by transit photometry, measure precise radii and bulk 
densities of known exoplanets and select the best-suited targets for 
atmospheric characterization by future spectrographs from space or 
on large ground-based telescopes. JWST will have unprecedented 
thermal infrared sensitivity. Its four instruments will, in addition to 
the direct imaging of planets, attempt transit observations at low-to- 
medium-resolution (R = 100-1,500) in the near- and mid-infrared 
domain for atmospheric characterization. Whereas several of the 
known hot gas giants will be amenable to detailed studies with JWST 
(Fig. 2), additional low-mass targets, Neptunes, super-Earths and 
Earth-like planets, will be delivered by TESS, CHEOPS and PLATO. 

Atmospheric characterization of transiting and non-transiting exo- 
planets has already been initiated with current ground-based direct 
imaging and resolved spectroscopy as well as high-resolution spec- 
trographs (for example, CRIRES and HARPS). These capabilities will 
be considerably extended with the upcoming generation of visible 
and near-infrared instruments equipping 4-m to 8-m telescopes. The 
advent of ELTs such as E-ELT, TMT and GMT, in combination with 
high spatial and spectral resolution, will amplify this tendency and 
open up a new parameter space, for instance by enabling the detection 
of bands of molecular oxygen on super-Earths transiting M dwarfs'”’. 

Techniques such as radial-velocity, photometry, astrometry, imag- 
ing and spectroscopy will all contribute to the field of exoplanets. 
Whereas in the past the groups using these techniques seemed to be in 
competition, now, in view of achieving a comprehensive understand- 
ing of the ‘new worlds’ we are looking for, the results they produce are 
highly complementary. The more mature the field becomes, the more 
we understand that we will not find another Earth with one single mis- 
sion, but only with the combination ofall the tools that are offered to 
us over the next decades. m 
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Multifunctional organoboron compounds 


for scalable natural product synthesis 
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Efficient catalytic reactions that can generate C-C bonds enantioselectively, and ones that can produce trisubstituted 
alkenes diastereoselectively, are central to research in organic chemistry. Transformations that accomplish these two 
tasks simultaneously are in high demand, particularly if the catalysts, substrates and reagents are inexpensive and if the 
reaction conditions are mild. Here we report a facile multicomponent catalytic process that begins with a chemoselective, 
site-selective and diastereoselective copper-boron addition to a monosubstituted allene; the resulting boron-substituted 
organocopper intermediates then participate in a similarly selective allylic substitution. The products, which contain a 
stereogenic carbon centre, a monosubstituted alkene and an easily functionalizable Z-trisubstituted alkenylboron group, 
are obtained in up to 89 per cent yield, with more than 98 per cent branch-selectivity and stereoselectivity and an enan- 
tiomeric ratio greater than 99:1. The copper-based catalyst is derived from a robust heterocyclic salt that can be prepared 
in multigram quantities from inexpensive starting materials and without costly purification procedures. The utility of 
the approach is demonstrated through enantioselective synthesis of gram quantities of two natural products, namely 


rottnestol and herboxidiene (also known as GEX1A). 


Enantioselective processes where a catalyst unites a pair of starting mate- 
rials and then induces the resulting species to react with a third substrate 
are sought-after in chemistry'*. Pathways that involve difficult-to-access 
intermediates and products then become feasible, and wasteful and costly 
procedures for isolation and/or purification of sensitive reagents become 
unnecessary’. Rare instances of such multicomponent processes can 
be found in phosphine-Ir or Ru-catalysed enantioselective reductive 
fusion of hydrogen, unsaturated hydrocarbons and carbonyl or imine 
compounds**. An unprecedented degree of complexity would result ifa 
multitasking catalyst were to promote several transformations that are each 
selective on multiple levels, with the final product bearing the marks of every 
single discriminatory event; a representative pathway is shown in Fig. la. 


Multicomponent synthesis of complex fragments 
Boron-substituted alkenes are widely used multipurpose moieties. Single- 
catalyst/multisubstrate transformations that deliver multifunctional unsat- 
urated organoboron compounds are therefore of great interest. In the 
first phase of our studies (Fig. 1b), we found that chemoselective addi- 
tion of (phosphine)Cu-B(pin) (here B(pin) = (pinacolato)boron), derived 
from reaction of an in situ generated (phosphine)Cu-alkoxide with 
B,(pin)2, toa monosubstituted allene (versus an aldehyde) affords 2-B(pin)- 
substituted allylcopper complex i, which then reacts with an aldehyde 
(versus an allene) to afford homoallylic alkoxide iii. An assortment of 
aldol-type products were obtained after oxidative treatment in up to 
>99:1 diastereomeric ratio (d.r.) and 97:3 enantiomeric ratio (e.r.)°. In 
contrast, transformations with N-heterocyclic carbene (NHC) complexes 
of copper, while efficient, generated racemic products. 

The above reactions give 1,1-disubstituted alkenylboron units because 
of a second-stage y addition (ii), which causes the loss of an important 
attribute of the initially formed intermediate (i): a stereochemically defined 
and modifiable trisubstituted olefin. A multicomponent catalytic enan- 
tioselective process that preserves the trisubstituted alkenylboron group 
would have higher value. We thus envisioned a transformation involv- 
ing chemo-, site- and stereoselective Cu-B(pin) addition to an allenyl 


substrate followed by chemo- and site-selective (branched versus linear) 
cross-coupling of the resulting allylcopper species through enantiose- 
lective allylic substitution (EAS). The envisioned catalytic sequence would 
furnish multifunctional organoboron products v by a single operation; 
this would be in contrast to the existing strategies where each functional 
unit must be installed individually through extended and less efficient 
sequences’*® (for a complete bibliography, see Supplementary Informa- 
tion). Such a process would bea significant addition to an important but 
limited group of catalytic allyl—allyl reactions. Site- and enantioselective 
incorporation of allyl groups through catalytic EAS has been confined 
to simple fragments introduced via allylboron’, allylmagnesium’? or 
allylic alcohol’* compounds (see Supplementary Information for a com- 
plete bibliography). 

The expected organoboron products (v, Fig. 1b) are rich in adaptable 
moieties. A stereogenic centre could be formed in the homoallylic posi- 
tion of a stereochemically defined trisubstituted alkenylboron unit that 
may be converted to other E- or Z-trisubstituted olefins. For instance, 
conversion of the C-B(pin) of v to a C-C bond with inversion of stereo- 
chemistry would deliver vi, which is a functional group found in numer- 
ous biologically active molecules; a notable case corresponds to a segment 
of immunosuppressive agent FK-506"* (compare highlighted fragment 
in Fig. 1c). Efficient and stereoselective synthesis of such trisubstituted 
olefin-containing fragments remains a difficult problem. In previous 
efforts either the undesired Z olefin was removed from a near-equal mix- 
ture of isomers'*"*, or modification of a terminal alkyne by relatively 
lengthy routes was required’. The terminal olefin of the products is 
an asset as well: it would provide the opportunity for many types of 
modifications. One example entails conversion to an E,E-diene by sequen- 
tial catalytic cross-metathesis with vinyl-B(pin)’* and cross-coupling 
(Fig. 1c)'’, generating a fragment that is common to several biologically 
active natural products. The highlighted segments in nafuredin (NADH- 
fumarate reductase inhibitor’*”’), milbemycin 3 (insecticidal”’), rott- 
nestol (member ofa family of antibiotics’) and herboxidiene (phytotoxic, 
anti-tumour”’) are representative. 


1Department of Chemistry, Merkert Chemistry Center, Boston College, Chestnut Hill, Massachusetts 02467, USA. 
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Figure 1 | Multicomponent catalytic enantioselective generation of 
alkenylboron compounds. a, The general scheme for a multicomponent 
catalytic cycle involving a reagent and two substrates might be envisioned to 
proceed by a sequence entailing multiple selectivity issues. Ideally, all points of 
selectivity would be retained. b, Catalytic stereoselective generation of an 
alkenyl-B(pin) intermediate (i), which might react in situ site-, diastereo- and 
enantioselectively with an aldehyde or an allylic phosphate to generate valuable 


Catalyst identification and method development 


Successful implementation of the aforementioned plan demands high 
chemoselectivity despite the involvement of two C-C x bonds (that is, 
Cu-B(pin) addition to allene versus allylic phosphate). Monosubstituted 
allenes” as well as allylic carbonates” have indeed been shown to undergo 
efficient reactions with copper-boron complexes. Allenes are compar- 
atively unhindered and might react with a Cu-B(pin) complex more 
readily, but the Lewis basic phosphate can associate with a transition metal 
to set offan undesirable sequence of events. Another strategic element 
is that reaction of the allylcopper intermediate with the allylic phosphate 
must be followed by a facile reductive elimination (iv, Fig. 1b); this way, 
the trisubstituted alkenylboron unit would be retained and the chiral, 
branched product isomers would be formed preferentially (that is, 3a 
favoured over 4-6; see Table 1). 


368 | NATURE | VOL 513 | 18 SEPTEMBER 2014 


OH 


vi Catalytic cross-coupling 


Me Me 
Nafuredin 
HO 
\ x NN onLe® 
Me Me OH 
Rottnestol 


Me Me Me OH 
Herboxidiene/GEX1A 


multifunctional products. In the second proposed sequence, each point of 
selectivity, especially the trisubstituted alkene, would be preserved within the 
final structure. c, Sequential catalytic Cu-B(pin) addition/enantioselective 
allylic substitution, affording products represented by v, constitutes an 
attractive strategy for synthesis of biologically active compounds. NHC, 
N-heterocyclic carbene; B(pin), (pinacolato)boron; G, functional group. 


To identify conditions that would deliver 3a in favour of 1,1- 
disubstituted alkenyl-B(pin) 4, achiral 5 or diene 6 (Table 1), we selected 
the reaction involving allene 1a and allylic phosphate 2a. We soon found 
that, unlike reactions with aldehydes* (Fig. 1b), a phosphine-Cu complex 
is ineffective (for example, with PCy3 and 7”, entries 1 and 2, Table 1), 
and bis-phosphine-derived catalysts cause only the allylic phosphate to 
be consumed (for example, with complex derived from 8, entry 3). That 
is, unlike the reactions involving aldehydes, monosubstituted allenes 
fail to compete with allylic phosphates when bis-phosphines serve as 
ligands. These observations substantiated our initial concerns regard- 
ing the presence of two types of electrophilic olefin. 

We then made the unexpected discovery that, in further contrast to 
carbonyl additions, an NHC-Cu complex can guide the catalytic cycle 
along the desired path (Table 1). The NHC-Cu species derived from 
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Table 1 | Examination of copper complexes as catalysts for sequential Cu-B(pin) addition/EAS 


The representative process: 


TBSO 


—" 
5.0 mol% ligand, 
TBSO la 5.0 mol% CuCl 
(1.2 equiv.) ———S 
1.5 equiv. KOt-Bu, TBSO 
PhO opocet, —_1.2 equiv. Ba(pin) 2, 


2a THF, 22°C, 18h 


Reacts readily in the 
absence of allene 
(see text) 


The phosphine ligands and NHC precursors: 


« addition, Sj2’ (branched) 
trisubstituted alkene, chiral 


TBSO Bepin) 
S S ~ 


(pin)B Ph Ph 

3a 4 
y addition, Sy2’ (branched) 
disubstituted alkene, chiral 


TBSO Bipin) 
Ph 


S Ph 
B(pin) 


a addition, Syy2 (linear) 
trisubstituted alkene, achiral 


5 


6 


y addition, S,y2 (linear) 
disubstituted alkene, chiral 


CO é “cl = Ph ph BF a . “Cl Phy eh 
SD Ge Ses WO acu O 
ig fe) .sPPh2 MesN.,NMes MesN. NMes 
CO 7 < 8 9a 9b 16 eo 10 1 
PFs PF, “PF. Et -pr, i-Pr PFe 
i ae a ea i ae ae © paces o 
‘OH ‘OH ‘OH OH Et ‘OH i-Pr 
12a 12b 12¢ 12d 12e 
Entry number Ligand or ligand precursor Conversion (%)*; yield of 3a (%)+ Site selectivity (3a:4:5:6)* Z:E* Enantiomeric ratio for 3at 

1 PCy3 <2; NA NA NA NA 
2 7 <2; NA NA NA NA 
3 8 >98; <2 NA NA NA 
4 9a <98; 81 SO8:<2:<2:<2 >98:2 NA 
5 9b 40:<2 NA NA NA 
6 10 >98; 36 >98:<2:<2:<2 >98:2 22:78 (R:S) 
7 11 >98; <2 NA NA NA 
8 12a >98; 67 298:<2:<2:<2 >98:2 89:11 (R:S) 
9 12b >98; 74 >98:<2:<2:<2 >98:2 93:7 (RS) 
10 12c >98; 72 2>98:<2:<2:<2 >98:2 85:15 (R:S) 
1 12d >98; 80 >98:<2:<2:<2 >98:2 94:6 (R:S) 
12 12e >98; 77 >98:<2:<2:<2 >98:2 92:8 (R:S) 


Reactions were carried out under Nz atmosphere; see Supplementary Information for details. NA, not applicable; Mes, 2,4,6-Me3-CgHo. 
* Conversion, site selectivity and Z:E ratios were determined by analysis of 400 MHz 'H NMR spectra; variance of values is estimated to be <+2%. 


+ Yield of purified products. 


{ Enantiomeric ratio values were determined by HPLC analysis; variance of values is estimated to be <+1%. See Supplementary Information for details. 


aryl-substituted heterocyclic salt 9a (entry 4) afforded 3a as the major 
component (81% yield); the alternative alkenylboron-containing pro- 
ducts 4, 5 or 6 were not detected (<2%); the complete site (branch) selec- 
tivity was equally surprising. Conversely, with enantiomerically pure 9b 
(entry 5), 3a was isolated in trace amounts, and reactions involving chiral 
salts 10°° and 11” either produced 3a in low yield (compare phenol 10, 
entry 6) or none atall (compare sulphonate 11, entry 7). Investigation of 
enantiomerically pure NHC precursors bearing an N-aryl and an N-alkyl 
group led us to establish that reaction with aminoalcohol-derived 12a” 
(entry 8, Table 1) affords 3a in 67% yield, >98% S\2’ selectivity and 89:11 
e.r. Follow-up studies revealed that enantioselectivity can be sensitive 
to the substituent at the stereogenic centre of the chiral catalyst (entries 
8-10, Table 1). Additional modification revealed that 12d is precursor to 
amore efficient (80% yield; entry 11) and enantioselective catalyst (94:6 
e.r.). Incorporation of larger N-aryl substituents did not lead to any 
improvement (compare 12e, entry 12). 

The method can be used to prepare a range of multifunctional organ- 
oboron compounds in high selectivity (Fig. 2a). The requisite imidazoli- 
nium salt 12d, an air-stable solid, can be prepared in multigram quantities 
bya modified version of a reported procedure”; the necessary reagents, 
including either enantiomeric form of phenylglycinol, can be bought 
at low cost. Allylic phosphates bearing sterically hindered substituents 
(3b, c), halogenated aryl groups (3d, e) or an alkyl unit (13) are suitable 
substrates. Although NHC-Cu-B(pin) complexes react readily with 
B-alkylstyrenes”, additions to an allene/EAS occur more readily (14). 
Allenes that contain other modifiable groups, such as an alkyne (15), an 
amine (16), or an amide (17 or 19) may be used. As the outcomes of the 
transformations expected to generate amides 17-19 indicate, a Lewis basic 
group, depending on its distance from the allene site, can alter reaction 
rates. Unsubstituted allene was used to access 1,1-disubstituted alkenyl- 
B(pin) 20 in 89% yield, >98% branch selectivity and 97:3 e.r. Catalytic 


cross-coupling reactions with readily accessible aryl halides proceed with 
retention of stereochemistry to generate trisubstituted alkenes (21-23, 
Fig. 2b). 


Origins of high efficiency and selectivity 

The challenge of designing a multicomponent process is in identifying 
a catalyst that can clear several efficiency and selectivity hurdles before 
reaching the finish line and starting again. Key attributes of the optimal 
NHC-Cu complex (derived from 12d) are discussed below. 


Chemoselectivity and efficiency 
The difference between percentage conversion and yield of 3a with cer- 
tain Cu complexes (Table 1) signals a breakdown in chemoselectivity: 
competitive Cu-B addition to 2a (versus allene 1a) leads to formation 
of by-products. Hence, it appears that the less Lewis basic and sterically 
demanding phosphine-based systems (for example, 8, Table 1), which 
are distinct from those of NHC ligands**”’, allow the phosphate to asso- 
ciate and react more readily. The pathway that hampers the transforma- 
tions of the less effective NHC-Cu complexes is more tractable. Allylic 
substitution of a B(pin) group with 2a produces a branched allylboron 
intermediate (24, Fig. 3a) that can be converted to the corresponding allyl- 
copper species (25), which then reacts with another molecule of allylic 
phosphate 2a to form 1,5-diene 26. Indeed, without the allene, the NHC- 
Cu complexes catalyse the formation of diene 26 efficiently (for example, 
53% yield for 9a, 76% yield for 11 and 50% yield for 12d). Similar gen- 
eration of an allylcopper might be inefficient with the Cu centre ofa bis- 
phosphine complex, which is probably less Lewis acidic as a result of its 
weaker two-electron donor ligand (versus an NHC)”, causing the allyl- 
boronate compound to react in other ways. 

Comparison of the transformations performed with NHC-Cu com- 
plexes derived from 9c-f (Fig. 3b) indicates that the proper balance 
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a Anefficient, practical and selective multicomponent NHC-Cu-catalysed process 
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at 22 °C, 18 h: at 22°C, 18 h: 


>98% conv., 83% yield, <2% conv. of 2a 


>98% branched, >98% Z, 94:6 e.r. 


>98% conv., 68% yield, 
>98% branched, >98% Z, 94:6 e.r. 


>98% conv., 75% yield, 
>98% branched, >98% Z, 92:8 e.r. 


A Bn2N 
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15 16 
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>98% conv., 70% yield, 
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>98% conv., 73% yield, 
>98% branched, >98% Z, 99:1 e.r. 
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19 20 
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87% conv., 74% yield, 
>98% branched, >98% Z, 95:5 e.r. 


>98% conv., 89% yield, 
>98% branched, 97:3 e.r. 


b Representative catalytic cross-coupling functionalization of the trisubstituted alkenyl-B(pin) products 


TBSO 
2.0 mol% Pd,(dba)3, 


Sess 4.0 mol% S-Phos 
(pin)B En 1.5 equiv. 2-bromopyridine, 
3a THF:3.0 M NaOH (3:1), 


60°C, 18h 


With >98% retention of stereochemistry 


Figure 2 | Catalytic chemo-, site- and enantioselective multicomponent 
reactions. a, Transformations are promoted by NHC-Cu complexes generated 
in situ from 12d, which can be easily prepared from inexpensive starting 
materials on a multigram scale in ~50% overall yield. Transformations proceed 
with 5.0 mol% catalyst at 4-22 °C and are complete in 18-24h to deliver the 
desired products in >98% Z, Sy2' and chemoselectivity and 92:8 to >99:1 


between electronic attributes and size of the heterocyclic ligand is needed 
if high efficiency and chemoselectivity are to be achieved. The catalyst aris- 
ing from 9d is too large to promote transformation, whereas the ligands 
derived from the smaller 9c and 9f contain N-alkyl units (versus N-aryl) 
and are therefore too nucleophilic to facilitate the desired succession of 
events. Imidazolinium salt 9e delivers an NHC-Cu complex that is small 
enough to promote reaction without being too diminutive or overly nucle- 
ophilic to promote Cu-B(pin) addition to the allylic phosphate. The 
complex resulting from 10 (>98% conversion, 36% yield of 3a; entry 6, 
Table 1) in all probability serves as a monodentate ligand that contains 
an N-mesityl and a smaller ortho-substituted N-aryl moiety, rendering 
it less selective (see below). The bidentate Cu catalyst arising from 11 is 
probably the only instance of a bidentate complex formed in the screen- 
ing studies (detailed below); the cuprate species possesses higher-energy 
Cu d-electrons that are more suitable for interacting with the lower- 
lying n* orbital of an allylic phosphate (versus an allene)”’, facilitating 
the undesired allylic substitution of a B(pin) unit (low chemoselectivity). 
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77% yield, >98% E 


TBSO TBSO 


22 23 
74% yield, >98% E 83% yield, >98% E 
e.r. b, The trisubstituted alkenyl-B(pin) obtained with complete Z selectivity 
can be converted to a variety of trisubstituted E alkenes through catalytic 
cross-coupling with readily available aryl bromides; all reactions proceed 
with complete retention of stereochemistry. Conv., conversion; TBS, 
t-butyldimethylsilyl; dba, dibenzylideneacetone; S-Phos, 
2-dicyclohexylphosphino-2’ ,6’-dimethoxybiphenyl. 


Branch- and enantioselectivity 

Chiral heterocyclic ligands (for example, 10, 11, 12a-e) with a chelating 
group commonly serve as precursors to bidentate NHC-Cu systems (that 
is, cuprate complexes). However, the resulting Cu—O tether can rupture 
through reaction with B2(pin)., revealing a monodentate complex that 
carries a neutral metal centre**. For two reasons we did not initially think 
that such a cleavage would take place with Cu complexes resulting from 
12a-e. First, exceptional Sy2' selectivity is usually observed with reactions 
of organoboron compounds that are promoted by bidentate NHC-Cu 
catalysts**”°; this preference may be attributed to rapid reductive elim- 
ination of the Cu(t) intermediate* so that substantial steric hindrance 
can be relieved (compare the top pathway available to IT in Fig. 3d). The 
less sterically demanding monodentate complexes, on the other hand, 
generate achiral linear isomers either preferentially** or to a significant 
degree**. Second, high enantioselectivities have been observed with catalysts 
that contain a chiral NHC ligand that is either bidentate (for example, 11 
in Table 1)’*, or monodentate (for example, 9b) but with conformationally 


©2014 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a The major competitive pathway in NHC-Cu-catalysed reactions 
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Figure 3 | Origins of high efficiency and selectivity. a, Low efficiency with 
some NHC-Cu-catalysed reactions is due to a competitive pathway arising 
from undesirable chemoselectivity. b, The efficiency of the multicomponent 
process hinges on the catalyst possessing the appropriate steric and electronic 
attributes. c, Modification of the chiral NHC ligand indicates that the optimal 
Cu-based catalyst is likely to be a monodentate complex with an N-alkyl side 
chain containing the sole stereogenic centre. d, DFT calculations point to a 


constraining stereogenic centres**”’, or both”***”*. High enantioselec- 


tivity without bidentate ligation and/or conformationally restricting 
substituents is unusual, since it must originate from a single stereogenic 
centre within the conformationally flexible arm ofa C\-symmetric NHC 
ligand. Nevertheless, such a scenario became irrefutable when we found 
that reaction with silyl ether 12f (Fig. 3c) proceeds with nearly identical 
efficiency and selectivity as when 12d is used. (We were unable to pre- 
pare and examine an authentic sample of the boronate derivative; similar 
results were obtained with the corresponding tert-butyldiphenylsilyl ether 
analogue of 12f.) Additionally, with methyl ether 12g (Fig. 3c), enantio- 
selectivity was all but completely eroded. Stereoselectivity is thus likely 
to be induced by the large B(pin)-substituted chiral appendage, formed 
through reaction of B,(pin), with the Cu-O bond and emulated by the 
silyl ether in 12f. 

Calculations through the use of density functional theory (DFT) point 
to transition structure I as the source of the major product enantiomer 
(Fig. 3d; see Supplementary Information for details of all calculations). 


>98% conv., 69% yield, 
>98% 3a, >98% Sn2’, 51:49 er. 


>98% conv., 53% yield, 
>98% 3a, >98% Sy2’, 51:49 e.r. 
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mode of transformation (I) leading to the major enantiomer (versus III). 
The uniformly exceptional branch or Sy2’ selectivity (versus linear or Sy2), 
despite the involvement of a neutral monodentate NHC-Cu catalyst, might be 
due to steric facilitation of the reductive elimination step (versus 7-allyl 
formation) via II and IV. e, Evidence for the importance of Cu-phosphate 
chelation to reaction efficiency. Dipp, 2,6-(i-Pr)2CsH3; TBS, (t-Bu)Me.Si. 


The allylic phosphate occupies two sites of the tetrahedral Cu(1) com- 
plex to generate a square planar Cu(111) species that undergoes reductive 
elimination via II to give viii (versus ix). The P=O—Cu coordination 
facilitates the association of the allylic phosphate with the sterically 
demanding NHC-Cu-allyl complex; this picture is supported by the var- 
iations in reaction efficiency observed for the transformations involving 
products 17-19 (Fig. 2a). In the case of 18 (<2% conversion), the Lewis 
basic amide carbonyl is properly situated to chelate with the Cu centre 
in the first intermediate to prevent phosphate chelation (x, Fig. 3e). The 
ring size in the bidentate complex xi is similar to that found in the oxi- 
dative addition precursor V (Fig. 3e). 

The two-point catalyst/substrate binding enhances the organization 
of the stereochemistry-determining transition state, generating high ste- 
reochemical induction via II. The minor isomer is probably produced 
via III, wherein the sizeable boronate group can swerve into close con- 
tact with the protruding allylic phosphate substituent. The B(pin) moiety 
of the allyl ligand must either collide with the ethyl substituents of the 
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NHC’s N-aryl moiety (shown) or induce steric repulsion due to prox- 
imity of the B(pin) unit and the NHC side chain. There must therefore 
be a feature of the catalyst structure that is responsible for C-C bond 
formation occurring near the chiral arm of the NHC ligand. Molecular 
models suggest that, because of steric factors, the ortho (ethyl) substi- 
tuents of the ligand’s N-aryl moiety discourage placement of the allyl 
fragments in their vicinity. The complete loss of enantioselectivity that 
results from placement of the groups at the N-aryl moiety’s C3 and C5 
positions corroborates the proposed scenario (that is, the derived boro- 
nate of NHC precursor 12h, Fig. 3c). 


Exceptional Sy2’:Sy2 ratios 

Then there are the exceptional Sy2':S\2 ratios despite involvement of 
a monodentate-Cu complex**”*. This almost certainly originates from 
the sizeable B(pin) unit of the allyl ligand of the Cu(1m) complex; the boro- 
nate moiety is absent in the formerly examined EAS reactions with organ- 
oboron compounds**”». The steric repulsion engendered by the sizeable 
B(pin) group elevates the ground state energy of the Cu(m) intermediate 
species II (major) and IV (minor), accelerating reductive elimination 
(to give viii in Fig. 3d) before it can collapse to the r-allyl species (ix in 
Fig. 3d). DFT calculations support the contention that the presence of 
the large B(pin) group lowers the activation barrier to reductive elim- 
ination (strain release). 


Gram-scale total synthesis of rottnestol 


Synthesis ofa complex organic molecule with catalytic multicomponent 
processes as its central feature would be a clear indicator of the utility of 


a Enantioselective synthesis of the polyene segment of rottnestol 
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b Enantioselective synthesis of the carbohydrate segment of rottnestol and 
ma 
M 2S 
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such processes, particularly if meaningful quantities ofa target molecule 
were to be secured. We first designed a route to prepare gram quantities 
of pure rottnestol where every issue of stereochemical control would be 
addressed bya catalytic transformation. We envisioned using the NHC- 
Cu-catalysed enantioselective process involving an allylic phosphate for 
synthesis of the polyene segment, while the carbohydrate moiety would 
be accessed through a catalytic B2(pin),/allene/aldehyde fusion (Fig. 4). 
The synthesis route commenced with the Cu-B(pin) addition/EAS 
sequence (Fig. 4a). Treatment of monosubstituted allene 1b and methyl- 
substituted allylic phosphate 2b with 3.0 mol% S-12d and CuCl (Fig. 2) 
afforded trisubstituted alkenyl-B(pin) 27 in 79% yield, with complete 
branch and Z selectivity and in 92:8 e.r.; the reaction was performed on 
two batches of ~1.7 g of 1b, delivering a total of ~4.2 g of the product. 
Conversion of the C-B(pin) to a C-Me bond was accomplished with 
complete inversion of stereochemistry (>98% E) through reaction with 
methyllithium and iodine”, delivering trisubstituted alkene 28 in 91% 
yield (3.4 g). NHC-Ru-catalysed E-selective cross-metathesis with com- 
mercially available vinyl-B(pin)’* in the presence of 5.0 mol% Ru carbene 
29% and formation of the corresponding alkenyl-iodide’* proceeded in 
80% overall yield, furnishing ~3.6 g of 30, which was transformed to 
1.4g of triene 31 in three straightforward steps (73% overall yield). 
To prepare the carbohydrate fragment (Fig. 4b), we adopted an enan- 
tioselective reaction involving aldehyde 32, which can be prepared in 
one step from a commercially available alcohol and four other entities 
that can also be purchased: methylallene 1c, B2(pin)2, bis-phosphine 
33 and CuCl. Silyl protection of the B-hydroxyketone afforded 34 in 
75% overall yield (3.4 g through two batches) with complete control of 


With >98% inversion of stereochemistry 
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Figure 4 | Enantioselective gram-scale synthesis of rottnestol. Every 
stereochemical issue in the route is addressed by a catalytic process that involves 
an organoboron compound; this is highlighted by two multicomponent 
chemo-, site-, diastereo- and enantioselective assemblies. a, Site- and 
enantioselective NHC-Cu-catalysed B,(pin),/allene/allylic phosphate and 
NHC-Ru-catalysed catalytic cross-metathesis (CM) reactions are combined to 
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access the acyclic fragment. b, A phosphine-Cu-catalysed multicomponent 
process involving an allene and an aldehyde is used to access the carbohydrate 
moiety. The final fragment coupling is achieved by phosphine-Pd-catalysed 
coupling, generating nearly 1.2 g of the natural product. DMAP, 
4-dimethylaminopyridine; DMPU, N,N’-dimethyl-N,N’ -trimethyleneurea; 
dppf, 1,1'-bis(diphenylphosphino) ferrocene. Ad, adamantyl. 
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Figure 5 | Enantioselective gram-scale synthesis of herboxidiene. The key 
step, taking place at mid-point in the multistep route, involves a relatively 
complex enantiomerically pure monosubstituted allene (37). The reaction can 
be performed on gram-scale batches to obtain ~1.7 g of 1,5-diene 38 for each 
run (~76% yield), with >98% site-, Z- and diastereoselectivity. Subsequent 
conversion to E,E-diene 39 proceeds with complete stereochemical control as 


stereoselectivity (>98:2 d.r. and e.r.). Ketone 34 was converted to carbohy- 
drate 35 after four steps in 67% overall yield (1.18 g). NHC-Cu-catalysed 
protoboration of the terminal alkyne in 35 furnished B-alkenyl-B(pin) 
36 in 97:3 B:o ratio, >98% E selectivity and 93% yield (1.8 g)*. More than 
one gram of stereoisomerically pure rottnestol was obtained after cata- 
lytic cross-coupling” of alkenyl-iodide 31 and alkenyl-B(pin) 36 followed 
by generation of the cyclic hemiacetal by acid treatment. The route in 
Fig. 4is more efficient than those reported previously (21.5% versus 3.7% 
overall yield)” and which resulted in no more than milligram quantities 
of the target molecule. 


Gram_-scale total synthesis of herboxidiene 


Devising a route leading to anti-tumour agent herboxidiene was next. 
Here, we explored a different aspect of the NHC-Cu-catalysed transfor- 
mation (Fig. 5). In the case of rottnestol, the multicomponent process 
was employed early on (27, Fig. 4); in contrast, with herboxidiene, the 
process would be implemented at a later stage with a more complex allene. 
In the event, ~7 g of substrate 37 were obtained by a seven-step procedure 
in 29% overall yield, >98:2 d.r. and e.r.; the central reaction in the sequence 
was phosphine-Cu-catalysed multicomponent reaction of B2(pin)2, meth- 
ylallene 1c and an aldehyde derived from (R)-methyl lactate (compare 
synthesis of 34). Considerable structural complexity, including the appro- 
priate 1,5-relative stereochemistry, was thus generated in short order: 
1,5-diene 38 (~ 1.7 g for each run) was obtained in ~76% yield with com- 
plete site-, Z- and diastereoselectivity. Trisubstituted olefin 39 was accessed 
through alkylation and catalytic cross-metathesis with vinyl-B(pin), yield- 
ing ~3.3 g of the desired product; both alkenes were formed with >98% E 
selectivity. Phosphine—-Pd-catalysed cross-coupling of alkenyl-B(pin) 39 
with alkenyl-iodide 40, synthesized through a diastereo- and enantiose- 
lective eight-step process starting from -(+ )-citronellene (see Supplemen- 
tary Information), afforded 2.55 g of triene 41. After three operations**, 
1.03 g of the anti-tumour agent was secured; this represents an overall 
yield nearly twice that of the most concise of the previously reported 
syntheses* (5.5% versus 3.4%; see Supplementary Information for 
bibliography). 


Conclusions and discussions 


The advances outlined here demonstrate that two simple unsaturated 
organic molecules and a commercially available diboron reagent can 
be combined to generate multifunctional alkenylboron fragments that 
are marked by several advantageous attributes. The requisite catalyst 
is assembled in situ by the reaction of abundant and inexpensive CuCl 
and a chiral ligand that is synthesized in multigram quantities readily 


well. Catalytic cross-coupling generates triene 41, which is then transformed to 
1.03 g of the natural product. It is noteworthy that every transformation shown 
above that involves a stereochemical issue proceeds with complete selectivity 
(that is, catalytic multicomponent process, alkylation of the alkenylboron 
intermediate, catalytic cross-metathesis, catalytic cross-coupling and directed 
epoxidation). acac, acetylacetone. 


and cost-effectively. Owing to the above features, and because the NHC- 
Cu-catalysed process is robust, gram quantities of a variety of complex 
organic molecules become reliably available. 

This advance foreshadows the development of protocols involving 
additional difficult-to-access alkenylboron-containing organocopper 
compounds. What emerges is the possibility of using other abundantly 
available poly-unsaturated hydrocarbons, such as dienes** or enynes*”, 
for efficient preparation of high-value products. Such a strategy obviates 
the need for succumbing to one-at-a-time installation of each functional 
unit, resulting in pathways that are unnecessarily time consuming, costly 
and waste generating. 
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Cichlid fishes are famous for large, diverse and replicated adaptive radiations in the Great Lakes of East Africa. To under- 
stand the molecular mechanisms underlying cichlid phenotypic diversity, we sequenced the genomes and transcriptomes 
of five lineages of African cichlids: the Nile tilapia (Oreochromis niloticus), an ancestral lineage with low diversity; and four 
members of the East African lineage: Neolamprologus brichardi/pulcher (older radiation, Lake Tanganyika), Metriaclima 
zebra (recent radiation, Lake Malawi), Pundamilianyererei (very recent radiation, Lake Victoria), and Astatotilapia burtoni 
(riverine species around Lake Tanganyika). We found an excess of gene duplications in the East African lineage compared 
to tilapia and other teleosts, an abundance of non-coding element divergence, accelerated coding sequence evolution, 
expression divergence associated with transposable element insertions, and regulation by novel microRNAs. In addition, 
we analysed sequence data from sixty individuals representing six closely related species from Lake Victoria, and show 
genome-wide diversifying selection on coding and regulatory variants, some of which were recruited from ancient poly- 
morphisms. We conclude that a number of molecular mechanisms shaped East African cichlid genomes, and that amass- 
ing of standing variation during periods of relaxed purifying selection may have been important in facilitating subsequent 
evolutionary diversification. 


Wide variation in the rates of diversification among lineages is a feature 
of evolution that has fascinated biologists since Darwin’”. With approx- 


shaped by cycles of population expansion, fragmentation and contrac- 
tion as lineages colonized lakes, diversified, collapsed when lakes dried 


imately 2,000 known species, hundreds of which coexist in individual 
African lakes, cichlid fish are amongst the most striking examples of 
adaptive radiation, the phenomenon whereby a single lineage diversifies 
into many ecologically varied species in a short span of time’ (Fig. 1). The 
largest radiations, which in Lakes Victoria, Malawi and Tanganyika, have 
generated between 250 (Tanganyika) and 500 (Malawi and Victoria) 
species per lake, took no more than 15,000 to 100,000 years for Victoria 
and less than 5 million years for Malawi’, but 10-12 million years for 
Lake Tanganyika®. The radiations in Lake Victoria and Malawi thus dis- 
play the highest sustained rates of speciation known to date in vertebrates’. 
The evolution of these lineages and their genomes has presumably been 


up, and re-colonized lakes, and by episodic adaptation to a multitude of 
ecological niches coupled with strong sexual selection. Genetic diversity 
within lake radiations has been influenced by admixture following mul- 
tiple colonization events and periodic infusions through hybridization*’. 

Cichlid phenotypic diversity encompasses variation in behaviour, body 
shape, coloration and ecological specialization. The frequent occurrence 
of convergent evolution of similar ecotypes (Fig. 1) suggests a primary 
role of natural selection in shaping cichlid phenotypic diversity'®"’. In 
addition, the importance of sexual selection is demonstrated by a pro- 
fusion of exaggerated sexually dimorphic traits like male nuptial colour 
and elaborate bower building by males’. Ecological and sexual selection 
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Figure 1 | The adaptive radiation of African cichlid fish. Top left, map of 
Africa showing lakes in which cichlid fish have radiated. Right, the five sequenced 
species: Pundamilia nyererei (endemic of Lake Victoria); Neolamprologus 
brichardi (endemic of Lake Tanganyika); Metriaclima zebra (endemic of Lake 
Malawi); Oreochromis niloticus (from rivers across northern Africa); 
Astatotilapia burtoni (from rivers connected to Lake Tanganyika). Major 
ecotypes are shown from each lake: a, pelagic zooplanktivore; b, rock-dwelling 
algae scraper; c, paedophage (absent from Lake Tanganyika); d, scale eater; e, snail 


converge in the cichlid visual system, where trichromatic colour vision, 
eight different opsin genes and novel spherical lenses promote sensitivity 
in the highly dimensional visual world of clear-water lakes'?-*. Rapidly 
evolving sex determination systems, often linked to male and female 
colour patterns, may also speed cichlid diversification'’*’*. Ecological, 
social and behavioural variation correlates with striking diversity in brain 
structures’”” that appears early in development’*. 

Exceptional phenotypic variation, even among closely related species, 
makes cichlids different from most other fish groups, including those that 
share the same habitats with them but have not diversified as much, as 
well as those that have radiated into much smaller species flocks in north- 
ern temperate lakes’. However, how cichlids evolve in this exceptionally 
highly dimensional phenotype space remains unexplained. 

Wesequenced the genomes of five representative cichlid species from 
throughout the East African haplo-tilapiine lineage (Extended Data Fig 1a), 
which gave rise to all East African cichlid radiations. These five lineages 
diverged primarily through geographical isolation, and three of them 
subsequently underwent adaptive radiations in the three largest lakes 
of Africa (Fig. 1). Here we describe the comparative analyses of the five 
genomes coupled with an analysis of the genetic basis of species diver- 
gence in the Lake Victoria species flock to examine the genomic sub- 
strate for rapid evolutionary diversification. 


Accelerated gene evolution 


To assess whether accelerated sequence evolution was a general feature 
of East African cichlids, we annotated the genomes of all five cichlids 
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crusher; f, reef-dwelling planktivore; g, lobe-lipped insect eater; h, pelagic 
piscivore; i, ancestral river-dweller also found in lakes (absent from Lake 
Tanganyika). Bottom left, phylogenetic tree illustrating relationships between the 
five sequenced species (red), major adaptive radiations and major river lineages. 
The tree is from ref. 4, pruned to the major lineages. Upper timescale (4), 
lower timescale (32). Photos by Ad Konings (Tanganyika a, b, d, e, g, h; Malawi 
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(Tanganyika f), Oliver Selz (Victoria f, A. burtoni), Marcel Haesler (O. niloticus). 


(Extended Data Fig. 1a) and estimated the nonsynonymous/synonym- 
ous nucleotide substitution (dN/dS) ratio by sampling the concatenated 
alignments of all genes annotated with particular gene ontology (GO) 
terms. An elevated rate of nonsynonymous nucleotide substitutions can 
indicate accelerated evolution (either due to relaxed constraint or posi- 
tive selection); this approach has been applied previously in the context 
of cichlid vision’’ and morphology*”". We obtained significantly higher 
dN/dS ranks in O. niloticus (89 terms) compared to stickleback (11 terms), 
but considerably higher ranks still in the lineages of the East African 
radiation, haplochromines (299 terms) and N. brichardi (254 terms), 
(Extended Data Fig. 1b). In general, terms involved in morphological 
and developmental processes ranked significantly higher in haplochro- 
mines than in O. niloticus (P value = 0.036, Mann-Whitney U-test). 

Amongst protein-coding genes with an increased number of non- 
synonymous variants in haplochromines compared to N. brichardi and 
O. niloticus, two developmental genes, nog2 and bmpr1b, emerged show- 
ing haplochromine-specific substitutions. This result is notable given 
that three genes, a ligand (bmp4)”’, a receptor (bmpr1b) and an antag- 
onist (nog2) in the BMP pathway, all known to influence cichlid jaw mor- 
phology, show accelerated rates of protein evolution in haplochromine 
cichlids. 

Of 22 candidate genes previously identified in teleost morphogenesis, 
vision and pigmentation, three are predicted to have undergone accel- 
erated evolution in the common ancestors of the East African radiations 
suggesting a role in the diversification of cichlids: endothelin receptor 
type B1 (ednrb1) affects colour patterning” and perhaps pharyngeal jaw 
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development (Extended Data Fig. 2); green-sensitive opsin (kfh-g) and 
Rhodopsin (rho) are proteins important in vision. 


Gene duplication 

Gene duplication allows for subsequent divergent evolution of the resul- 
tant gene copies, enabling functional innovation of the proteins and/or 
expression patterns’. East African cichlids, including Oreochromis nilo- 
ticus, possess an unexpectedly large number of gene duplicates. We find 
280 duplications in the lineage leading to the common ancestor of the 
lake radiations and 148 events in the common ancestor of the haplochro- 
mines. When normalizing for branch lengths this corresponds to an 
approximately 4.5- to 6-fold increase in gene duplications that occurred 
in the common ancestor of the East African lake radiations relative to 
older clades, and an even higher duplication rate in the common ancestor 
of just the haplochromines (Fig. 2, Extended Data Fig. 3a-c). 

Inferred duplication rates in ancestral populations exceeded those 
in the extant taxa (Fig. 2). This could reflect the technical challenge of 
separating young, near-identical gene paralogues or true reduced rates 
in each lake radiation. Additionally, we could be underestimating lineage- 
specific rates of duplication owing to the sampling ofa single species per 
radiation, if duplications accumulate during speciation but only some 
become fixed. 

Cichlid-specific gene duplicates do not show statistically significant 
enrichment for particular gene categories (Supplementary Information). 
Expansion of the olfactory receptor gene family, which is a frequent fea- 
ture of vertebrate evolution”, was also seen in O. niloticus, but not in 
any of the lake cichlids (Extended Data Fig. 4; Supplementary Informa- 
tion). Retained duplicated genes are known to often diverge in function 
through neo- or subfunctionalization’’, and this has been suggested as 
part of the reason why bony fish generally are so species-rich (more than 
50% of all known species of vertebrates are fish). Moreover, differential 
retention of alternative copies of duplicated genes through the process 
of divergent resolution has been suggested to promote speciation rates 
directly”®. 

Differences in the expression patterns of duplicate genes may con- 
tribute to evolutionary divergence of species. The expression patterns 
of 888 duplicate gene pairs from the common ancestor of the East Africa 
cichlids were categorized according to whether they are expressed widely 
among tissues (52.8%), are similarly restricted in their expression pat- 
terns for both gene copies (26.6%), or, in at least one gene copy, have 
newly gained expression in one or more tissues (20.6%). 7.5% of dup- 
licates lost or gained complete tissue specificity, many (43%) of which 
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Figure 2 | Gene duplication in the ancestry of East African lake cichlids. 
Black numbers represents species divergence calculated as neutral genomic 
divergence between the sequenced species using ~2.7 million fourfold 
degenerate sites from the alignment of 9 teleost genomes. This neutral 
substitution model suggests ~2% pairwise divergence between the three 
haplochromines and a ~6% divergence to N. brichardi. Red numbers represent 
duplicated genes. Asterisks indicate excluded branches owing to incomplete 
lineage sorting in haplochromines or weak support of consensus species tree. 
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have gained specific expression in the testis. In each of the stomatin and 
RNF141 gene pairs, one gene copy is broadly expressed whereas expres- 
sion of the other is restricted to the testis (Extended Data Fig. 3d). RNF141 
is the zebrafish orthologue of the human ZNF230, a transcription factor 
suggested to have a role during spermatogenesis. This observation is 
particularly interesting in the context of strong sexual selection’ observed 
in many East African cichlids'*"*, including our sequenced species with 
the exception of N. brichardi. 


Transposable element insertions alter gene expression 


As in other teleosts, approximately 16-19% of the four East African 
cichlid genomes consist of transposable elements (TEs), and over 60% 
of cichlid TEs are DNA transposons (Extended Data Fig. 5; Supplemen- 
tary Information). Three waves of TE insertions were detected in each 
of the cichlid genomes (Extended Data Fig. 6a—f), including a cichlid- 
specific burst of the Tigger family*’. Notably, this TE family has contin- 
ued expanding in the youngest radiation, Lake Victoria (Extended Data 
Fig. 6a). 

Weanalysed the distribution of TE insertions near the 5’ untranslated 
region (5’ UTR; 0-20 kilobases upstream), or 3’ UTR (0-20 kb down- 
stream) of orthologous gene pairs. We find that genes with TE insertions 
near the 5’ UTRs are significantly associated with increased gene expres- 
sion in all tissues (false discovery rate (FDR) < 0.05, Mann-Whitney test, 
Extended Data Fig. 7a) compared to genes without TE insertions. In 
contrast, TE insertions near 3’ UTRs are significantly associated with 
increased gene expression in all tissues except brain and skeletal muscle 
(FDR < 0.05, Mann-Whitney U-test). 

Generally, when inserted within or near genes in the transcriptional 
sense orientation, TE insertions show the expected pattern of purify- 
ing selection. Such TEs often contain polyadenylation signals that result 
in transcriptional arrest’’. In all five cichlid species, intronic TE inser- 
tions occur preferentially in the antisense orientation of protein-coding 
genes, with the strongest bias being observed for long terminal repeats 
(LTRs) or long interspersed nucleotide repetitive elements (LINEs) (Ex- 
tended Data Fig. 7b). As expected, intronic DNA transposons and LINEs 
or LTRs present in intergenic regions fail to show a significant orienta- 
tion bias, and short interspersed nucleotide repetitive elements (SINE) 
show a moderate bias for sense insertions (Extended Data Fig. 7c). 

Surprisingly, none of the five cichlid genomes showed any deficit of 
sense-oriented LINE insertions with approximately 15% divergence, 
which correspond to a time of transposable element insertions in the 
common ancestor of the haplo-tilapiine cichlids (Extended Data Fig. 7d). 
This suggests that ancestral East African cichlids went through an extended 
period of relaxed purifying selection during which overall TE activity 
increased (Extended Data Fig. 6a—f). However, in more recent history, 
haplochromine cichlids showed an increased efficiency in purging poten- 
tially deleterious TE insertions (Extended Data Fig. 7d). 


Divergence of regulatory elements 


To identify potential regulatory sequences that have diverged among the 
East African cichlids, we first predicted conserved noncoding elements 
(CNEs)’** in Nile tilapia and eight other teleosts using a 9-way alignment 
of teleost genomes (zebrafish, Tetraodon, stickleback, medaka and the 
five cichlids; Supplementary Information). We then identified 13,053 
highly conserved noncoding elements (hCNEs) in tilapia and medaka. 
These are expected to be similarly conserved among the four East African 
lake cichlids as they shared a common ancestor with Nile tilapia more 
recently than with medaka. Among these hCNEs we searched for CNEs 
that exhibited significant changes (accelerated CNEs, aCNEs) (FDR- 
adjusted P < 0.05). A total of 625 such aCNEs (4.8%) were found to have 
diverged in one or more of the East African lake cichlids. Whereas the 
majority of aCNEs (93%) have experienced a higher rate of nucleotide 
substitutions, approximately a quarter have also experienced insertions 
(23%) and/or deletions (32%), again suggesting relaxed purifying selec- 
tion. The aCNEs are distributed in intergenic regions (70%), introns (28%) 
and UTRs (2%) of protein-coding genes (Supplementary information). 
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The largest number of aCNEs is found in N. brichardi(n = 214), with 
lower numbers found in A. burtoni (n = 140), P. nyererei(n = 129) and 
M. zebra (n = 142). Approximately 60% of the aCNEs (n = 370) are 
accelerated in only one lineage. The remaining aCNEs have either accu- 
mulated mutations independently in several lineages, or their acceler- 
ated evolution was initiated in a common ancestor. 

The majority of aCNEs in lake cichlids showed enrichment for nearby 
genes involved in ‘homophilic cell adhesion’ (P = 5.8 X 107 *) and ‘G- 
protein coupled receptor activity’ (P = 6.4 X 10~*). To verify the cis- 
regulatory function of these aCNEs, we assayed the ability of six selected 
aCNEs and their corresponding O. niloticus hCNEs to drive reporter 
gene expression in transgenic zebrafish. The assays not only indicated 
their potential to function as enhancers, but also demonstrated that 
aCNEs have altered the expression pattern compared to their homolo- 
gous hCNEs, indicating their potential for altering expression of their 
target genes in a tissue-specific manner. We illustrate this with an exam- 
ple in Extended Data Fig. 8 (additional examples in Extended Data Fig. 9). 


Novel microRNAs alter gene expression 


MiRNAs offer yet another effective way of altering gene expression pro- 
grams. We identified 1,344 miRNA loci (259-286 per cichlid species) 
from deep sequencing of small RNAs in late stage embryos (Extended 
Data Fig. 10a). By comparing these loci with known teleost microRNAs 
(Supplementary Information) we discovered: (1) 40 cases of de novo 
miRNA emergence and nine cases of apparent miRNA loss; (2) four 
distinct mature miRNAs with mutation(s) in the seed sequence; (3) at 
least 9 cases of arm switching”, (4) one case of seed shifting”’, and (5) 92 
distinct miRNAs with mutation(s) outside the seed sequence. 

We explored miRNA spatial expression patterns in one case of arm 
switching (t_mze-miR-7132a-5p and t_mze-miR-7132a-3p) and for four 
de novo miRNAs (Fig. 3 and Extended Data Fig. 10). In the case of arm 
switching, spatial expression of the miRNA is clearly differentiated between 
the two pairs, consistent with results described previously’. The spatial 
expression of the four de novo miRNAs (miR-10029, miR10032, miR- 
10044, miR- 10049) is confined to specific tissues (for example, fins, facial 
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Figure 3 | Novel cichlid microRNAs. a-f, Complementary expression of 
mir-10029 (b, d, f) and its predicted target gene bmpr1b (a, ¢, e) in stage 18 
(6 days post-fertilization) Metriaclima zebra embryos. c-f are 18-|1m sagittal 
sections. In c and d arrows point to expression (black) or lack of expression 
(white) in the somites, presumptive cerebellum, and optic tectum (from left to 
right). In e and f, arrows point to expression and lack of expression in the 
somites (dorsal) and the gut (ventral). In all panels, anterior is to the right. 
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skeleton, brain) and is strikingly complementary to genes predicted to 
contain target sites for these miRNAs (miR-10032 targets neurod2, and 
miR-10029 targets bmpr1b). The neurod2 gene is known to be involved 
in brain development and neural differentiation whereas bmpr1b, pre- 
viously described amongst the fast evolving genes, is implicated in the 
development and morphogenesis of nearly all organ systems. 


Extensive shared polymorphisms 


Owing to their relatively recent divergence time and the potential for 
gene flow between lakes***', we predicted widespread incomplete lin- 
eage sorting (ILS) among haplochromine cichlids. We found that nearly 
half (43%) of the nucleotides sequenced are incompletely sorted amongst 
the three haplochromines (Fig. 4a). Furthermore, assuming a constant 
mutation rate, and an A. burtoni-M. zebra-P. nyererei speciation event 
~10 million years ago (Myr ago) (ranging from 7 Myr ago to 15 Myr ago 
depending on whether Gondwana rifting dates are included or excluded 
from calibration’), we predict the subsequent speciation event between 
the lineages to which M. zebra and P. nyererei belong to about 8.5 Myr 
ago (Supplementary Information). The degree of ILS is highly variable 
across chromosomes. Compared to intergenic regions, coding regions 
were found to be slightly, yet significantly, depleted in ILS (43.5% vs 
41.0%, P < 0.001). Reduction of ILS in coding versus noncoding regions 
in allopatric lineages of haplochromine cichlids is less than that found 
in the similarly divergent primate trio, gorilla-chimpanzee-human (30% 
vs 22%)°*. This suggests that natural selection has been a more efficient 
force on primate genomes than on the allopatrically diverging genomes 
of the haplochromine cichlid lineages, with important implications for 
genetic diversity in the radiations to which these lineages gave rise. 


Lake Victoria, a recent evolutionary radiation 


Cichlid fish adaptive radiation is characterized by rapid speciation without 
geographical isolation. In Lake Victoria, several hundred endemic species 
emerged within the past 15,000-100,000 years**. We analysed patterns 
of genome-wide genetic variation in six sympatric and closely related 
species of the genera Pundamilia, Mbipia and Neochromis, all of which 
are endemic to Lake Victoria. We used the P. nyererei genome to inves- 
tigate the pattern and magnitude of genomic differentiation in pairwise 
species comparisons. We then further characterized the regions of geno- 
mic differentiation to learn about: (1) the genomic distribution of diver- 
gent sites putatively under selection; (2) their nature (coding vs regulatory); 
(3) whether diversification occurred by selection on old standing vari- 
ation, newer mutations or both. 


Divergent selection on many genes 


Analyses of restriction-site-associated DNA (RAD) data showed that 
the average genome-wide divergence was significant in all pairwise spe- 
cies comparisons (P < 0.001). In each pairwise comparison, we find many 
SNPs with high fixation index (Fsr) values distributed across all chro- 
mosomes (Fig. 4c). In each pair, 250 to 439 of these SNPs constitute 
significant outliers from the Fsry distribution (FDR < 5%; Fig. 4c), and 
BAYESCAN results indicate numerous loci under selection. Phyloge- 
netic trees reconstructed from the concatenated RAD sequence data 
resolve species with high bootstrap support”’, and loci putatively under 
selection play a strong role in differentiating species (Fig. 4b). Taken 
together, these results suggest that even the most recent rapid specia- 
tion in African lake cichlids is associated with genomically widespread 
divergence. Fixation of alternative alleles between species happens but 
is restricted to a minority of the many divergent loci, consistent with 
models of polygenic adaptation from standing genetic variation”’. 
Weused the annotated P. nyererei reference genome to identify genes 
that diverged during and soon after speciation for three sister species 
pairs and two pairs of more distant relatives (Fig. 4c). We annotated all 
SNPs according to their positions in exons and potential cis-regulatory 
elements (in introns and 25 kb either side of genes), and analysed the 
proportion of SNPs in each category over increasing Fey. In both pairs 
of sister species that differ primarily in male breeding coloration, the 
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Figure 4 | Genomic divergence stems from incomplete lineage sorting (ILS) 
and both old and novel coding and noncoding variation. a, Coalescence 
times and trees supporting ILS among the genomes of allopatric East African 
cichlid lineages were inferred by coalHMM. The most common genealogy 
matches the known species tree and represents a M. zebra-P. nyererei 
coalescence that falls between the two speciation times, T,,, (speciation 

M. zebra-P. nyererei) and T-,,, (speciation M. zebra-P. nyererei-A. burtoni). In 
genealogies 1 (dashed line), 2, and 3, all coalescence events are ancient and 
occur before time T-,,. b, Phylogenetic analysis of RAD-sequence data showing 
well-supported differentiation among young Victoria species. The complete 
data set (top) renders the genus Mbipia non-monophyletic, exclusion of the top 
1% divergent loci (bottom) supports monophyly of each genus. c, Genomic 
divergence in paired comparisons of Lake Victoria cichlids (per-site Fsr; 
black/grey are chromosomes). Sister species from top: Pundamilia 


proportion of SNPs in exons increases from <10% in the full set of SNPs, 
to >18% at highly divergent SNPs. In the species that have diverged pri- 
marily in morphology, we find no exonic variants among highly divergent 
SNPs, and an increasing proportion of SNPs in introns with increasing 
Fer (Fig. 4c). 

These data suggest contrasting genomic mechanisms underlying phe- 
notypic evolution depending on whether speciation is driven primarily 
by divergence of coloration and associated traits or by divergence of 
morphology associated with feeding ecology. This supports two predic- 
tions from evolutionary developmental biology”: (1) variation in coding 
sequence is most likely to be involved in the divergence of physiological 
and/or terminally differentiated traits like colour; (2) regulatory vari- 
ation is more important in morphological changes involving genes that 
have pleiotropic effects in developmental networks. 


nyererei/P. pundamilia and Mbipia lutea/M. mbipi differ in male breeding 
coloration but have conserved morphology; Neochromis omnicaeruleus/N. sp. 
“unicuspid scraper” and distant relatives P. pundamilia/M. mbipi and 

P. nyererei/M. lutea have similar coloration but differ in morphology. 
Red-highlighted SNPs indicate significantly divergent sites between colour- 
contrasting species, but not between same-colour species. Bar plots show the 
proportion of SNPs in four annotation categories: exons (orange), introns (dark 
blue), 25-kb flanking genes (turquoise), or none of the above (grey), for 
thresholds of increasing Fer. In “All sites” and “Ancient variant sites” analyses, 
symbols indicate an excess of SNPs in a given annotation category compared to 
expectations from the full data set or from all non-ancient variant sites, 
respectively (FDR q-values: *q < 0.05; tq = 0.05), (Supplementary 
Information, Data Portals, Supplementary Population Genomics FTP files). 


For the Pundamilia species pair, putative regulatory SNPs with Fey 
values significantly greater than zero show enrichment in conserved 
transcription factor binding sites and PhastCon elements (conserved 
elements across 46 vertebrate species), supporting a regulatory role for 
these variants. GO term enrichment analyses indicate that exonic SNPs 
are associated with metabolism and biosynthesis processes, while puta- 
tive regulatory SNPs are associated with terms related to morphogenesis 
and development. 

Comparing Fer for each SNP in all six pairwise comparisons of the 
Mbipia and Pundamilia species revealed 3 candidate regulatory SNPs 
on LG6, 7 and 22 that are highly divergent in all comparisons of species 
with different colours, but not significantly differentiated between spe- 
cies with similar colours (Fig. 4c). The SNP on LG7 falls within a known 
quantitative trait locus (QTL) interval for yellow versus blue colour (and 
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sex determination) in Malawi cichlids’. None of these SNPs are fixed 
differences between species, suggesting polygenic adaptation. 


Sorting of ancient polymorphisms 

To investigate whether ancient genetic variation, predating the origin 
of the Lake Victoria species flock, was an important source of alleles that 
are divergently sorted during speciation, for SNPs in each of the three 
Victoria sister species pair comparisons, we identified orthologous sites 
among the four other cichlid genomes. We find 14-15% of all Victoria 
SNPs are also variable among the other cichlid genomes. Among these 
‘ancient variants’, the proportion of SNPs in exons increases from 9-15% 
among all sites to 30-100% at highly divergent SNPs in both pairs of 
sister species that differ primarily in male breeding coloration (Fig. 4c). 
Among the ancient exonic variants that became fixed in the red/blue 
Pundamilia speciation event is srd5a2b, a teleost-specific duplicate of 
srd5a2 which, in mammals, converts testosterone to dihydrotestoster- 
one and has been implicated in sexual differentiation®’. In the blue sister 
species that have diverged primarily in morphology, two ancient variants 
in potential cis-regulatory regions are highly divergent despite incom- 
plete reproductive isolation among these incipient species” (Fig. 4b). 
We compared the proportions of putative ancient variants to all SNPs 
between annotation categories, and find evidence for higher proportions 
of ancient variants in gene-associated regions than in non-genic regions 
(likelihood ratio tests on 2 < 2 contingency tables; exons: Pundamilia 
P= 0.016, Neochromis P = 0.015; flanking regions: Pundamilia P = 0.020; 
all other P> 0.1). 

These analyses suggest that the genomic substrate for adaptive radi- 
ation includes ample coding and regulatory polymorphism, likely to 
be present well before the start of the radiations, some of which became 
subsequently sorted during species divergence. 


Conclusions 


In African lakes, nearly 1,500 new species of cichlid fish evolved in a 
few million years when environmentally determined opportunity for 
sexual selection and ecological niche expansion’ was met by an evolu- 
tionary lineage with unusual potential to adapt, speciate and diversify. 
Our analyses of five cichlid species representing five different lineages 
in the haplo-tilapiine clade, some of which gave rise to radiations, and 
of six closely related species from the most recent radiation, shed light 
into the complex genomic mechanisms that may give East African cich- 
lids their unusual propensity for diversification. 

We provide evidence for accumulation of genetic variation under 
relaxed constraint preceding radiation and involving multiple evolution- 
ary mechanisms, including accelerated evolution of regulatory and coding 
sequence, increased gene duplication, TE insertions, novel micoRNAs 
and retention of ancient polymorphisms, possibly including interspecific 
hybridization. In addition, our data on genomic divergence within the 
Lake Victoria species flock suggest that adaptive radiation within the lakes 
is associated with divergent selection on many regions in the genome, both 
coding and regulatory, often recruiting old alleles from standing variation. 

We conclude that neutral and adaptive processes both make impor- 
tant contributions to the genetic basis of cichlid radiations, but their 
roles are distinct and their relative importance has changed through time: 
neutral (and non-adaptive) processes seem to have been crucial to amass- 
ing genomic variation, whereas selection subsequently sorted some of 
this variation. The interaction of both is likely to have been necessary 
for generating many and diverse new species in very short periods of time. 
Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Extensive genomic characterization of human cancers presents the problem of inference from genomic abnormalities to 
cancer phenotypes. To address this problem, we analysed proteomes of colon and rectal tumours characterized previ- 
ously by The Cancer Genome Atlas (TCGA) and perform integrated proteogenomic analyses. Somatic variants displayed 
reduced protein abundance compared to germline variants. Messenger RNA transcript abundance did not reliably predict 
protein abundance differences between tumours. Proteomics identified five proteomic subtypes in the TCGA cohort, two 
of which overlapped with the TCGA ‘microsatellite instability /CpG island methylation phenotype’ transcriptomic subtype, 
but had distinct mutation, methylation and protein expression patterns associated with different clinical outcomes. Although 
copy number alterations showed strong cis- and trans-effects on mRNA abundance, relatively few of these extend to the 
protein level. Thus, proteomics data enabled prioritization of candidate driver genes. The chromosome 20q amplicon was 
associated with the largest global changes at both mRNA and protein levels; proteomics data highlighted potential 20q 
candidates, including HNF4A (hepatocyte nuclear factor 4, alpha), TOMM34 (translocase of outer mitochondrial membrane 
34) and SRC (SRC proto-oncogene, non-receptor tyrosine kinase). Integrated proteogenomic analysis provides functional 


context to interpret genomic abnormalities and affords a new paradigm for understanding cancer biology. 


TCGA has characterized the genomic features of human cancers'® 
and this has presented a new challenge of explaining how genomic al- 
terations drive cancers’. As proteins link genotypes to phenotypes, the 
Clinical Proteomic Tumour Analysis Consortium (CPTAC) is perform- 
ing proteomic analyses of TCGA tumour specimens for selected cancer 
types. Here we present the first integrated proteogenomic character- 
ization of human cancer with an analysis of the TCGA colorectal cancer 
(CRC) specimens’. 

The TCGA study affirmed well-established genomic features of CRC 
and described three transcriptional subtypes, 17 chromosomal regions 
of significant focal amplification and 28 regions of significant focal dele- 
tion, and linked genomic features of CRC to critical signalling pathways. 
The drivers underlying copy number alterations (CNAs) and transcrip- 
tional subtypes are largely unknown, and an integrative analysis of both 
genomic and proteomic data may provide a more comprehensive under- 
standing of the information flow from DNA to protein to phenotype. 


Peptide and protein identification 


We performed liquid chromatography-tandem mass spectrometry (LC- 
MS/MS)-based shotgun proteomic analyses on 95 TCGA tumour sam- 
ples (Extended Data Fig. 1 and Methods), the clinical and pathological 
characteristics and TCGA data sets for which are summarized in Sup- 
plementary Table 1. Benchmark quality control samples from one basal 
and one luminal human breast tumour xenograft were analysed in alter- 
nating order after every five CRC samples (Methods). 


We identified a total of 124,823 distinct peptides among the 95 sam- 
ples, corresponding to 6,299,756 spectra in an assembly of 7,526 protein 
groups with a protein-level false discovery rate (FDR) of 2.64% (Methods 
and Extended Data Fig. 2). To facilitate integration between genomic 
and proteomic data, a gene-level assembly of the peptides identified 
7,211 genes. 

A fundamental question in proteogenomics is which protein coding 
alterations are expressed at the protein level. As standard database search 
approaches cannot identify variant peptides from MS/MS data, we also 
performed database searches with customized sequence databases from 
matched RNA sequencing (RNA-seq) data for individual samples*” 
(Methods and Extended Data Fig. 3). 

We identified 796 single amino acid variants (SAAVs) across all 86 
tumours for which matched RNA-seq data were available (Fig. 1a, b and 
Supplementary Tables 2 and 3), among which 64 corresponded to somatic 
variants reported by TCGA and 101 were reported in the COSMIC data- 
base (that is, COSMIC-supported variants). Of the remaining SAAVs, 
526 were listed in the Single Nucleotide Polymorphism database (dbSNP) 
(that is, doSNP-supported variants) and are likely to be germline variants. 
The 162 previously unreported SAAVs might be explained by novel 
somatic or germline variants, RNA editing, or, in some cases, false discovery. 

The identified somatic variants were clearly enriched in the hyper- 
mutated samples, whereas the germline variants showed no association 
with hypermutation (Fig. 1a). Although 58% of the germline variants 
occurred in two or more samples, almost all somatic variants occurred 
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Figure 1 | Summary of detected single amino acid variants (SAAVs) and are shown in the Venn diagram. There are 162 SAAVs that have not been 


the impact of single nucleotide variants (SNVs) on protein abundance. 

a, The number of different types of SAAVs (TCGA-reported somatic variants, 
COSMIC-supported variants, dbSNP-supported variants and new variants) 
in individual tumour samples. The samples are ordered by the number of 
detected somatic variants, then COSMIC-supported variants, and then 
dbSNP-supported variants. The MSI and hypermutation (Hyper) status are 
labelled below the bar charts for each sample (red, MSI-high; orange, MSI-low; 
yellow, microsatellite stable; blue, hypermutated; light blue, non-hypermutated; 
grey, no data). The number of somatic variants and COSMIC-supported 
variants were significantly higher in MSI-high and hypermutated tumours, 
whereas the other two types of SAAVs were randomly distributed across the 
data set. b, The total numbers for different types of SAAVs and their 
overlapping relations. All 796 detected SAAVs were annotated based on 
previous reports in dbSNP (left circle), COSMIC (middle circle) or 
TCGA-reported somatic variants (right circle), and their overlapping relations 


in only one sample (Fig. 1c). The low identification rate for somatic 
variants may reflect relatively low sequence coverage in shotgun pro- 
teomics; however, somatic variants also might negatively impact pro- 
tein abundance, possibly by reducing translational efficiency or protein 
stability’. Using the protein abundance quantification method described 
below and detailed in the Methods, we found that somatic variants exerted 
a significantly stronger negative impact on protein abundance than did 
dbSNP-supported variants (P < 2.2 X 107°, Kolmogorov-Smirnov test; 
Fig. 1d and Methods). The percentage of variants with an impact score 
of less than —2 was doubled for somatic variants compared to dbSNP- 
supported variants (P< 2.2 X 107 '°, Chi-squared test; Fig. 1d). 
Cancer-related variant proteins may serve as candidate protein bio- 
markers or therapeutic targets. The 108 somatic or COSMIC-supported 
protein variants mapped to 105 genes, including known cancer genes 
in the Cancer Gene Census database such as KRAS, CTNNB1, SF3B1, 
ALDH2 and FH. The list also included 14 targets of FDA-approved 
drugs or drugs in clinical trials*, such as ALDH2, HSD17B4, PARP1, 
P4HB, TST, GAK, SLC25A24 and SUPT16H. A subset of variant peptide 
sequences, including KRAS(Gly12Asp) were verified by targeted ana- 
lyses of tumour lysates spiked with synthetic, isotope-labelled peptide 
standards (Methods). One example is shown in Extended Data Fig. 4. 


Quantification of protein abundance 


To quantify protein abundance, we used spectral counts, which are the 
total number of MS/MS spectra acquired for peptides from a given 
protein’ (Methods and Supplementary Table 4). Analysis of data from 
benchmark quality control samples demonstrated platform reproduci- 
bility throughout the analyses and enabled evaluation of data normali- 
zation methods (Extended Data Fig. 5a, b). Based on the minimal spectral 
count requirement established using the quality control data set (Extended 
Data Fig. 5c), 3,899 genes with a protein-level FDR of 0.43% were used 
to compare relative protein abundance across tumour samples. 


reported previously in these databases (new). c, Distribution of the frequency 
of occurrence for different types of SAAVs. Border colours of the pie charts 
correspond to different SAAV types using the same colour scheme as in 

a. Whereas 58% of dbSNP-supported variants occurred in two or more 
samples, almost all somatic variants occurred in only one sample each. d, SNVs 
detected in RNA-seq data were separated into three categories (dbSNP- 
supported, COSMIC-supported and TCGA-somatic). The impact of individual 
SNVs on protein abundance was calculated (see Methods) and the impact 
scores for different categories of SNVs were plotted as cumulative fraction 
curves with two-sided P values from the Kolmogorov-Smirnov test labelled. 
The percentage of SNVs with an absolute impact score greater than 2 was also 
plotted as an inset, with P values from the Chi-squared test. Sample size for the 
dbSNP-supported, COSMIC-supported and TCGA-somatic variants were 
1,2184, 7,492 and 3,302, respectively. 


mRNA versus protein abundance 

The matched proteomic and RNA-seq measurements from the TCGA 
CRC tumours enabled the first global analysis of transcript-protein 
relationships in a large human tumour cohort (Methods). First, we com- 
pared the steady state mRNA and protein abundance for each gene within 
individual samples (Methods and Extended Data Fig. 6a). All samples 
showed significant positive mRNA-protein correlation (multiple-test 
adjusted P < 0.01, Spearman’s correlation coefficient) and the average 
correlation between steady state mRNA and protein abundance in indi- 
vidual samples was 0.47 (Fig. 2a), which is comparable to previous reports 
in multi-cellular organisms’’. 

Next, we examined the concordance between mRNA and protein 
variation of individual genes across the 87 tumours for which 3,764 genes 
had both mRNA and protein measurements suitable for relative abund- 
ance comparison (Methods). Although 89% of the genes showed a posi- 
tive mRNA-protein correlation, only 32% had statistically significant 
correlations (Fig. 2b). The average Spearman’s correlation between 
mRNA and protein variation was 0.23, which was comparable to reported 
values for yeast, mouse and human cell lines’*"”’. 

To test whether the concordance between protein and mRNA vari- 
ation is related to the biological function of the gene product, we per- 
formed KEGG (Kyoto Encyclopedia of Genes and Genomes) enrichment 
analysis (Methods and Supplementary Table 5). Genes involved in several 
metabolic processes showed concordant mRNA and protein variation, 
whereas other gene classes showed low or even negative concordance in 
mRNA and protein variation (Fig. 2c). We also found that genes with 
stablemRNA and stable protein tend to have higher mRNA-protein cor- 
relation than those with unstable mRNA and unstable protein (P = 5.27 
X 10°, two-sided Wilcoxon rank-sum test, Methods, Extended Data 
Fig. 6b). Thus, mRNA measurements are poor predictors of protein abun- 
dance variations and both biological functions of the gene products and 
mRNA and protein stability may govern mRNA-protein correlation. 
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Figure 2 | Correlations between mRNA and protein abundance in TCGA 
tumours. a, Steady state mRNA and protein abundance were positively 
correlated in all 86 samples (multiple-test adjusted P < 0.01) with a mean 
Spearman’s correlation coefficient of 0.47. b, mRNA and protein variation were 
positively correlated for most (89.4%) mRNA-protein pairs across the 87 
samples, but only 32% showed significant correlation (multiple-test adjusted P 
< 0.01), with a mean Spearman’s correlation coefficient of 0.23. c, mRNA 
and protein levels displayed dramatically different correlation for genes 


Impact of copy number alterations 

The study by TCGA identified 17 regions of significant focal amplifi- 
cation and 28 regions of significant focal deletion. We examined the im- 
pact of CNAs on mRNA and protein abundance, including both cis-effects 
on the abundance of genes in the same loci and trans-effects on the 
abundance of genes at other loci in the genome (Methods). 

For all 23,125 genes with a CNA measurement in the TCGA data set, 
we calculated Spearman’s correlation with mRNA and protein abundance, 
respectively for the 3,764 genes with both mRNA and protein measure- 
ments (Methods). Examination of the matrix visualizing significant CNA- 
mRNA correlations (multiple-test adjusted P < 0.01) revealed strong 
positive correlations along the diagonal (Fig. 3a), suggesting strong cis- 
effects of CNAs on mRNA abundance. Most of the diagonal signals cor- 
responded to previously reported arm-level changes’. In contrast, the 
diagonal pattern was much weaker for CNA-protein correlations (Fig. 3b). 

To investigate further the cis-effects of CNAs, we separated all genes 
with CNA, mRNA and protein measurements into those in focal amp- 
lification regions, focal deletion regions and non-focal regions (that is, 
chromosomal regions without focal amplification or deletion). As shown 
in Extended Data Fig. 7, CNA-mRNA correlations were significantly 
higher than CNA-protein correlations for genes in all three groups 
(P<1.0X 102°, Kolmogorov-Smirnov test). Moreover, genes in the 
focal amplification regions showed significantly higher CNA-mRNA 
and CNA-protein correlations than genes in the non-focal regions 
(P=4.4 X 10 * and 0.02, respectively, Kolmogorov-Smirnov test). 
However, the same trend was not observed for genes in the focal dele- 
tion regions. Therefore, focal amplifications have the strongest cis-effects 
on both mRNA and protein abundance, suggesting that selection for 
high protein abundance may drive CNA in regions of focal amplifica- 
tion. However, many CNA-driven mRNA level increases do not trans- 
late into increased abundance of the corresponding proteins. 

Figure 3a, b also revealed multiple trans-acting CNA hot spots, defined 
as chromosomal loci whose alteration is significantly associated with 
abundance changes of many transcripts or proteins at other loci. Chro- 
mosomes 20q, 18, 16, 13 and 7 contained the five strongest hot spots 
driving global mRNA abundance variation. These hot spots also were 
strongest at the protein level. Most hot-spot-related transcript changes 
did not propagate to the protein level, presumably reflecting buffering 
of protein abundance by post-transcriptional regulation”. Notably, 
many hot-spot-associated protein-level alterations occurred in the 
absence of corresponding mRNA alterations, suggesting that the same 
trans-acting hot spot may exert independent effects at both the tran- 
scriptome and proteome levels. 
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involved in different biological processes. Genes encoding intermediary 
metabolism functions showed high mRNA-protein correlations, whereas 
genes involved in oxidative phosphorylation, RNA splicing and ribosome 
components showed low or negative correlations. Multiple-test adjusted two- 
sided P values from the Kolmogorov-Smirnov test were provided in the 
parentheses following the KEGG pathway names. Red and green in the figures 
indicate positive and negative correlations, respectively. 


The 20q amplification was associated with the largest global changes 
in both mRNA and protein levels in this univariate analysis. The same 
conclusion was reached with a regularized multivariate regression analy- 
sis method, remMap"* (Methods and Supplementary Tables 6-9). These 
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Figure 3 | Effects of copy number alterations on mRNA and protein 
abundance. a, b, The top panels show copy-number-abundance correlation 
matrices for mRNA abundance (a) and protein abundance (b) with significant 
positive and negative correlations (multiple-test adjusted P < 0.01, Spearman’s 
correlation coefficient) indicated by red and green, respectively, and genes 
ordered by chromosomal location on both x and y axes. The bottom panels 
show the frequency of mRNAs and proteins associated with a particular copy 
number alteration, where blue and black bars represent associations specific to 
mRNA and protein or common to both mRNA and protein, respectively. 
c-e, HNF4A (c), TOMM34 (d) and SRC (e) showed significant CNA-mRNA, 
mRNA-protein, and CNA-protein correlations (Spearman’s correlation 
coefficient). The colour grade from light yellow to red indicates relatively 
low-level (yellow) to high-level (red) of copy number, mRNA abundance, or 
relative protein abundance among the 85 samples, which were ordered by copy 
number data. 
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data highlight the importance of 20q amplification in CRC, which has 
not been well documented in previous studies. Among the 79 genes in 
the 20q region with quantifiable protein measurements, 67 (85%) showed 
significant CNA-mRNA correlation, but only 40 (51%) showed signifi- 
cant CNA-protein correlation (multiple-test adjusted P < 0.01, Spearman’s 
correlation coefficient, Supplementary Table 10). 

As significant CNA-protein correlations identify amplified sequences 
that translate to high protein abundance, proteomic measurements can 
help prioritize genes in amplified regions for further examination. Of 
particular interest among the 40 genes is HNF4A (Fig. 3c), a candidate 
driver gene nominated by TCGA for the 20q13.12 focal amplification 
peak®. HNF4c is a transcription factor with a key role in normal gastro- 
intestinal development” and is increasingly being linked to CRC”. 
However, there are contradictory reports on whether HNF4« acts as 
an oncogene or a tumour suppressor gene in CRC”. Upon reanalysis of 
the HNF4A short hairpin RNA (shRNA) knockdown data for CRC cell 
lines from the Achilles project”’, we found that the dependency of CRC 
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cells on HNF4« correlated significantly with the amplification level of 
HNF4A (Methods and Extended Data Fig. 8), which may partially 
explain the contradictory roles reported for HNF4« in CRC. Other 
interesting candidates included TOMM34 (Fig. 3d), which is overex- 
pressed frequently in CRC tumours and is involved in the growth of 
CRC cells”, and SRC (Fig. 3e), which encodes a non-receptor tyrosine 
kinase implicated in several human cancers including CRC”. 


Proteomic subtypes of CRC 


The TCGA study reported three transcriptomic subtypes of CRC, desig- 
nated ‘microsatellite instability/CpG island methylator phenotype’ (MSI/ 
CIMP), ‘invasive’, and ‘chromosomal instability’ (CIN). Given the lim- 
ited correlation between mRNA and protein levels, we asked whether 
CRC subtypes can be better represented with proteomics data. Using the 
consensus clustering** method (Methods and Extended Data Fig. 9), we 
identified five major proteomic subtypes in this tumour cohort, with 15, 
9, 25, 11 and 19 cases in subtypes A to E, respectively (Fig. 4a). 

We tested the association between the subtype classification and 
established genomic and epigenomic features of CRC using Fisher’s 
exact test (Fig. 4b and Supplementary Table 11). Almost all hypermutated 
and MSI-high tumours were included in subtypes B and C, as well as 
tumours with POLE and BRAF mutations. However, statistically significant 
association with these features was only observed for subtype B (multiple- 
test adjusted P < 0.05). Moreover, subtype B was significantly associated 
with the TCGA CIMP -high methylation subtype, whereas subtype C was 
significantly associated with a non-CIMP subtype (cluster 4). Another 
unique feature of subtype B was the lack of TP53 mutations and chro- 
mosome 18q loss. These results clearly established the association between 
proteomic subtype B and MSI-high and CIMP, but suggest that sub- 
type C may have different biological underpinnings. 

The remaining three subtypes were associated with CIN, another 
well-accepted genetic property of CRC. In particular, subtype E was 
significantly associated with both TP53 mutations and 18q loss, geno- 
mic features frequently associated with CIN tumours”. Interestingly, 
subtype E was also associated with HNF4A amplification and relatively 
higher abundance of HNF4z protein (Fig. 4c). HNF40 abundance was 
significantly higher in subtype E tumours compared to normal colon 
samples (multiple-test adjusted P = 1.09 X 10°, two-sided Wilcoxon 
rank-sum test); however, significant upregulation of HNF4« was not 
observed for other subtypes (Methods). This result, together with our 
reanalysis of siRNA knockdown data from the Achilles project (Extended 
Data Fig. 8), suggests that HNF4 dependency may be particularly asso- 
ciated with the subset of tumours or cells with HNF4A amplification. 


Figure 4 | Proteomic subtypes of colon and rectal cancers, associated 
genomic features, and relative abundance of HNF4a. a, Identification of five 
proteomic subtypes. Tumours are displayed as columns, grouped by proteomic 
subtypes as indicated by different colours. Proteins used for the subtype 
classification are displayed as rows. The heat map presents relative abundance 
of the proteins (logarithmic scale in base 2) in the 90-tumour cohort. 

b, Association of proteomic subtypes with major colorectal-cancer-associated 
genomic alterations and previously published transcriptomic and methylation 
subtypes. Subtypes that are significantly overlapped with a transcriptomic or 
methylation subtype are highlighted by pink boxes. Both proteomic subtypes B 
and C showed significant overlap with the TCGA MSI/CIMP subtype. In 
addition, they showed significant overlap with the CCS2 and CCS3 subtypes in 
the ref. 26 classification, respectively. Proteomic subtype B significantly 
overlapped with the TCGA CIMP-high methylation subtype, whereas subtype 
C significantly overlapped with a non-methylation subtype (TCGA cluster 4 
methylation subtype). Subtypes overrepresented with a specific genomic 
alteration are also highlighted by pink boxes. The green box highlights the 
absence of TP53 mutations and 18q loss in subtype B. c, The top panel shows 
HNF4A copy number and relative abundance of HNF4« protein in the five 
subtypes; the bottom panel compares relative abundance of HNF4c in the five 
subtypes to that in normal colon samples, respectively, and the adjusted P 
values are based on the two-sided Wilcoxon rank-sum test followed by 
multiple-test adjustment. 
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Wealso examined the association between the subtype classification 
and clinical features and found only that stage II tumours were signifi- 
cantly enriched in subtype C (multiple-test adjusted P < 0.05; Supplemen- 
tary Table 11). Supervised statistical analyses at the individual protein 
level for 13 clinical and genomic features also identified few, if any, sig- 
nificant protein effects of these features, except for hypermutation sta- 
tus, MSI status and 18q loss (Supplementary Table 12), suggesting that 
the proteomic subtypes identified by the unsupervised clustering ana- 
lysis captured the major proteome variations across the tumours. 

Next, we compared the proteomic subtype classification with the TCGA 
transcriptional subtype classification for the 62 samples that had both 
subtype labels. Proteomic subtypes B and C both showed significant 
association with the TCGA subtype MSI/CIMP (Fig. 4b and Supplemen- 
tary Table 11); however, they differ considerably at genomic, epigenomic 
and proteomic levels (Fig. 4a, b). We also examined alternative classifi- 
cations of the TCGA samples based on two recently published transcrip- 
tomic subtype classifiers**’’. Proteomic subtype C, but not subtype B, 
showed enriched overlap with the ‘stem-like’ subtype described in ref. 27 
and the colon cancer subtype 3 (CCS3) subtype described in ref. 26. Inter- 
estingly, tumours with stem-like and CCS3 classifications both have poor 
prognosis, which suggests that proteome subtype C also may be assoc- 
iated with poor prognosis. Therefore, the ability to distinguish subtype 
B from C through proteomics data are important, because MSI-high 
tumours typically have better prognosis”. 


Signatures for proteomic subtypes 


To better understand the biology underlying the proteomic subtypes, 
we identified protein signatures for each subtype by supervised com- 
parison of protein abundance in that subtype against all others; we also 
required signature proteins for a subtype to be significantly different in 
abundance compared to normal colon samples from 30 individuals ana- 
lysed on the same proteome analysis platform (Methods and Supplementary 
Tables 13 and 14). As shown in Extended Data Fig. 10a, all CRC sub- 
types displayed more than 2,000 (>60%) significant protein abund- 
ance differences compared to normal colon. Although a full validation 
of the proteomic subtypes and protein signatures for the subtypes will 
require proteomic profiling data from an independent tumour cohort, 
a low cross-validation error rate of 3.8% demonstrated good general- 
izability of the subtypes and their signature proteins (Methods). 

We performed Gene Ontology enrichment analysis for the subtype 
signatures using WebGestalt”* (Methods and Supplementary Table 15). 
Genes involved in ‘response to wounding’ were significantly enriched 
in the up-signature of subtype C (multiple-test adjusted P < 2.2 X 10° *°, 
Fisher’s exact test). The wound-response gene signature is a powerful 
predictor of poor clinical outcome in patients with early stage breast 
cancers”. This result further links our subtype C to poor prognosis. 

To understand better the functional networks underlying this sub- 
type with potential clinical importance, we uploaded the up and down 
signatures of subtype C to NetGestalt*® for enriched protein-protein 
interaction network module analysis. Four network modules were en- 
riched with genes in the up signature for subtype C, whereas two modules 
were enriched with genes in the down signature (multiple-test adjusted 
P< 0.01, Fisher’s exact test; Extended Data Fig. 10b). Notably, the down- 
signature-enriched module (III) included the E-cadherin (CDH1)-f- 
catenin (CTNNB1)-a-catenin (CTNNA1) complex (Extended Data 
Fig. 10c, e). E-cadherin, the most under-expressed protein in the sub- 
network, suppresses invasion in lobular breast carcinoma” and is a switch 
for the epithelial-to-mesenchymal transition (EMT), which is associated 
with poor prognosis in colon cancer*”. Other components of the mod- 
ule were desmosomal proteins (PKP2, JUP and DSG2) and cytokeratins 
(KRT18, KRT6A and KRT8). Reduction in both desmosome formation 
and cytokeratin expression is associated with EMT*’. Moreover, pro- 
teins in the most significantly upregulated network module (Extended 
Data Fig. 10d, f) included collagens (COL1A1 and COL3A1) and extra- 
cellular matrix glycoproteins (FN1, BGN, FBN1 and FBN2) that also 
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are markers of EMT**”». These data strengthen the association of sub- 
type C with poor prognosis and relate it to EMT activation. 


Discussion 


Our proteomic characterization of the genomically annotated TCGA 
colon tumours illustrates the power of integrated proteogenomic ana- 
lysis. The data demonstrate that protein abundance cannot be reliably 
predicted from DNA- or RNA-level measurements. mRNA and protein 
levels were modestly correlated, as earlier cell and animal model studies 
suggested”, but over two-thirds of these correlations were not statisti- 
cally significant in the TCGA tumour set. Although most CNAs in CRC 
drive mRNA abundance changes, relatively few translated to consistent 
changes in protein abundance. 

Genomic and proteomic technologies provide reinforcing data. RNA- 
seq data facilitate the discovery of variant proteins, which could serve as 
possible biomarker candidates or therapeutic targets. Combined mRNA 
and protein profiling data can identify potentially relevant genes in am- 
plified chromosomal regions. This approach, which revealed the impor- 
tance of chromosome 20q amplification and provided new insights into 
the role of HNF4« in CRC, can be broadly extended to understand roles 
of CNAs in other cancers. Proteomics identified CRC subtypes similar 
to those detectable by transcriptome profiles, but further captured fea- 
tures not detectable in transcript profiles. The separation of the TCGA 
MSI/CIMP subtype into distinct proteotypes illustrates the unique poten- 
tial of proteomics-based subtyping. After validation in independent cohorts, 
protein subtype signatures could be directly translated into laboratory 
tests for tumour classification. Integrated proteogenomic analysis, as 
demonstrated in this study, will enable new advances in cancer biology, 
diagnostics and therapeutics. 


METHODS SUMMARY 


All tumour samples for the current study were obtained through the TCGA Bio- 
specimen Core Resource (BCR) as described previously*. No other selection cri- 
teria other than availability were applied for this study. Patient-derived xenograft 
tumours from established basal and luminal B breast cancer intrinsic subtypes*”* 
were raised subcutaneously in 8-week-old NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ 
mice (Jackson Laboratories, Bar Harbour, Maine) as described previously”. Normal 
colon biopsies were obtained from screening colonoscopies performed between 
July 2006 and October 2010 under Vanderbilt University Institutional Review 
Board (IRB) approval no. 061096. 

Tissue proteins were extracted and tryptic peptide digests were analysed by 
multidimensional liquid chromatography-tandem mass spectrometry. Xenograft 
quality control samples were run after every five colorectal tumour samples. Raw 
data were processed for peptide identification by database and spectral library 
searching and identified peptides were assembled as proteins and mapped to gene 
identifiers for proteogenomic comparisons. Quantitative proteomic comparisons 
were based on spectral count data. Detailed descriptions of the samples, LC-MS/ 
MS analysis, and data analysis methods can be found in the Methods. All of the 
primary mass spectrometry data on TCGA tumour samples are deposited at the 
CPTAC Data Coordinating Center as raw and mzML files and complete protein 
assembly data sets for public access (https://cptac-data-portal.georgetown.edu). 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Molecular architecture and mechanism of 
the anaphase-promoting complex 


Leifu Chang'*+*, Ziguo Zhang*+*, Jing Yang’*+, Stephen H. McLaughlin? & David Barford!*+ 


The ubiquitination of cell cycle regulatory proteins by the anaphase-promoting complex/cyclosome (APC/C) controls sister 
chromatid segregation, cytokinesis and the establishment of the G1 phase of the cell cycle. The APC/C is an unusually large 
multimeric cullin-RING ligase. Its activity is strictly dependent on regulatory coactivator subunits that promote APC/C- 
substrate interactions and stimulate its catalytic reaction. Because the structures of many APC/C subunits and their orga- 
nization within the assembly are unknown, the molecular basis for these processes is poorly understood. Here, from a 
cryo-electron microscopy reconstruction of a human APC/C-coactivator-substrate complex at 7.4 A resolution, we have 
determined the complete secondary structural architecture of the complex. With this information we identified protein 
folds for structurally uncharacterized subunits, and the definitive location of all 20 APC/C subunits within the 1.2 MDa 
assembly. Comparison with apo APC/C shows that the coactivator promotes a profound allosteric transition involving 
displacement of the cullin-RING catalytic subunits relative to the degron-recognition module of coactivator and APC10. 
This transition is accompanied by increased flexibility of the cullin-RING subunits and enhanced affinity for UBCH10- 


ubiquitin, changes which may contribute to coactivator-mediated stimulation of APC/C E3 ligase activity. 


Regulation of cell division by reversible protein phosphorylation and 
ubiquitination involves the coordinated interplay of protein kinases and 
phosphatases, and ubiquitin ligases and deubiquitinases'. The anaphase- 
promoting complex/cyclosome (APC/C) is an E3 cullin-RING ligase that 
mediates ubiquitin-dependent proteolysis of specific regulatory proteins 
to control chromosome segregation in mitosis, the events of cytokinesis 
and mitotic exit, maintenance of G1, and the initiation of DNA replication”’. 
Through its cullin-RING catalytic module, the APC/C is related to the 
Skp1-Cullin-F-box protein (SCF) E3 ligases responsible for controlling 
S and G2 progression. However, compared to other RING ligases, the 
APC/Cis unusually large. The reasons for its complexity likely arise from 
its role in orchestrating multiple cell cycle processes, its regulation by pro- 
tein phosphorylation, and as the ultimate effector of the spindle assem- 
bly checkpoint’. 

The activity of the APC/C is tightly controlled by structurally related 
coactivator subunits (either CDC20 or CDH 1). Their interaction with 
the APC/C is reciprocally regulated by CDK-dependent phosphoryla- 
tion of core APC/C and coactivator subunits, resulting in a switch of 
CDC20 and CDH1 in late mitosis and change of substrate specificity. 
Coactivators recruit substrates to the APC/C by recognizing conserved 
destruction motifs or degrons—the D box° and KEN box®. In addition 
to their substrate recruitment roles, coactivators stimulate the E3 ligase 
activity of the APC/C through an unknown mechanism’. 

The activated APC/C-coactivator complex is an assembly of 15 dif- 
ferent proteins in vertebrates (~ 1.2 MDa), of which only four have roles 
in catalysis (APC2 and APC11) and substrate recognition (APC10 and 
coactivator) (Extended Data Table 1a). Other large APC/C subunits func- 
tion as molecular scaffolds to coordinate the juxtaposition of the catalytic 
and degron-recognition modules, although more direct roles in substrate 
and/or E2-ubiquitin interactions are also possible. Remarkably, all scaf- 
folding subunits incorporate multiple repeat motifs. These include five 
TPR (tetratricopeptide repeat) proteins, the PC (proteasome/cyclosome) 
domain of APC1, and putative WD40 domain in APC4. 


Reconstitution of recombinant APC/C facilitated its structural and 
biochemical analysis, providing definition of subunit stoichiometry and 
the location of many APC/C subunits within the electron microscopy- 
derived molecular envelope*"°. However, the structures and locations 
of numerous APC/C subunits remained unknown, and there were un- 
certainties regarding the location of its catalytic centre. To address these 
questions we determined a sub-nanometre resolution electron microscopy 
reconstruction of human APC/Cas a ternary complex with the coactivator 
CDH1 and a high affinity substrate HSL1 (refs 11, 12) (APC/C-CDH1- 
HSL1). From this reconstruction we derived the complete secondary struc- 
ture of the APC/C, allowing us to define the architectures and correctly 
assign positions of all APC/C subunits. In addition, we found that coac- 
tivator enhances APC/C affinity for UBCH10-ubiquitin, explaining at 
least in part how the coactivator stimulates APC/C ubiquitination activity. 
Comparison of ternary and apo APC/C structures defines the basis of 
the coactivator-mediated allosteric transition and provides insights into 
the mechanisms underlying this increased affinity for UBCH10-ubiquitin. 


APC/C architecture 


We determined a three dimensional reconstruction of the human APC/ 
C-CDH1-HSL1 ternary complex to a nominal resolution of 7.4 A (Fig. 1, 
Extended Data Figs 1 and 2 and Extended Data Table 1b). The resultant 
map is of sufficient quality to visualize o-helices as discrete rods and 
B-sheets as planar densities. Figure 1 shows three views of the APC/C with 
its molecular envelope colour-coded according to subunit assignments, 
together with the underlying secondary structure elements and archi- 
tecture of individual APC/C subunits (Supplementary Video 1). Our 
higher resolution map indicates revised positions for APC1 and APC2. 
These subunits were previously assigned by means of antibody labelling’*"* 
which guided docking of APC2 into a lower resolution electron micro- 
scopy map of budding yeast APC/C*. 

The APC/C adopts a triangular shape delineated by a lattice-like shell*'*” 
(Fig. 1). The back and top of the complex is formed from the bowl-shaped 


1Division of Structural Biology, Institute of Cancer Research, 237 Fulham Road, London SW3 6JB, UK. @MRC Laboratory of Molecular Biology, Cambridge CB2 OQH, UK. Present address: MRC Laboratory of 


Molecular Biology, Cambridge CB2 OQH, UK (L-C., Z.Z., J.Y. and D.B.). 
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Figure 1 | Electron microscopy reconstructions of the H. sapiens APC/C- 
CDH1-HSLI ternary complex at 7.4 resolution. a-—c, Three views of the 
molecular envelope with the electron microscopy map segmented and 


TPR lobe—an assembly of the four canonical TPR proteins and their 
accessory subunits (Extended Data Table 1a). The base of the APC/C 
comprises the platform subunits of APC4 and APCS, together with two 
domains of APC1. The carboxy-terminal PC domain of APC1 (APC1"°) 
extends from the platform to contact the TPR lobe (Fig. 1b). The degron- 
recognition module of APC10 and CDH1 is located at the top of the cavity 
with APC10 interacting extensively with APC1°°. The catalytic subunits 
APC2 (cullin) and APC11 (RING) of APC/C are positioned at the periph- 
ery of the platform such that the C-terminal domain of APC2 (APC2“') 
and associated APC11 are at the front of the cavity below APC10 and 
CDH (Fig. 1a). 


The TPR lobe is a quasi-symmetric array 


The four evolutionarily related TPR proteins of the TPR lobe self-associate 
to form similar V-shaped homodimers (Fig. 2a-d). Each subunit com- 
prises an o-helical solenoid with two turns of TPR helix'**°. With the 
amino-terminal TPR helix forming the homodimer interface, the C- 
terminal TPR helix creates a protein-binding groove. APC6 binds its 
accessory subunit APC12 through this groove (Fig. 2a)'*"’, whereas the 
APC3 and APC8 homodimers utilize one of their dyad-related C-terminal 
grooves to engage CDH1 (described below). APC10 contacts the TPR 
helical groove of the opposite subunit of the APC3 homodimer. Rela- 
tive to the full-length Schizosaccharomyces pombe crystal structure’, 
APC6 differs slightly in the context of the APC/C (Extended Data Fig. 3a 
and b and Supplementary Video 1). 

TPR homodimers stack in parallel to generate a left-handed supra- 
helix featuring quasi-twofold symmetry (Figs 1, 2e and Supplementary 


APC12B 


Figure 2 | The TPR homodimers assemble to form a quasi-symmetric array. 
a-d, The four canonical TPR homodimers form structurally related “V-shaped 
homodimers. Shown are the segmented electron microscopy maps and 
tertiary structures of APC6 (a), APC3 (b), APC8 (c) and APC7 (d). The two 
subunits of the homodimer are designated ‘A’ and ‘B’ and for APC6 are shown 
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colour-coded according to subunit assignments. The molecular envelope is 
represented as a transparent surface to view the underlying secondary 
structural elements and subunit architectures. 


Video 1)*°. An ordered assembly of the TPR lobe is partly determined 
by the TPR accessory subunits APC13 and APC16, whose extended 
conformations span multiple TPR subunits (Fig. 2e). APC16 was iden- 
tified as a long rod-like density and modelled as a 40-residue o-helix. It 
lies along the shallow grooves created by «-helices of TPR subunits 
APC3 and APC7 (interfaces i to iii) (Fig. 1 and Extended Data Fig. 3c). 
The location of APC16 agrees with its mapping to the upper region of 
the TPR lobe”, and its co-purification with an APC3-APC7 sub-complex 
(data not shown). APC13 interacts with APC3, APC6 and APC (inter- 
faces iv to vii) (Fig. 1 and Fig. 2e and Extended Data Fig. 3c), consistent 
with electron microscopy analysis of an APC13-green fluorescent pro- 
tein fusion that positioned APC13 close to the APC6-APC3 interface’®. 


Architectures of APC1, APC4 and APC5 


APC4 and APC5 were assigned to the platform through antibody labelling", 
a negative stain electron microscopy analysis of an APC4-APC5 sub- 
complex®, and assignment of toroidal density to the putative WD40 do- 
main of APC4 (ref. 14) (Supplementary Video 1). Analysis of the platform 
in the APC/C-CDH1-HSL1 map reveals that the toroidal density com- 
prises a WD40 domain (APC4™”"*) attached to a long helical bundle 
domain (APC4™®?) (Fig. 3a and Extended Data Fig. 3d). This proposed 
architecture of APC4 is consistent with two-dimensional class views of 
an APC4-mFab complex derived using negatively stained electron micro- 
graphs (Extended Data Fig. 4), and with secondary structure predictions. 
Lying alongside APC4#®? is a TPR helix of 13 TPR motifs (Figs 1c and 3b) 
that corresponds to the TPR domain predicted for APC5 (APC5!PR) 
(ref, 8). Connected to APC5"?®, and at one end of APC4"™”, isa globular 


in two shades of magenta. C-terminal TPR motifs 13 and 14 of APC7B and 
APC8A project into solvent and are partially disordered. e, Front view of the 
TPR lobe showing how the TPR accessory subunits APC13 and APC16 span 
multiple TPR subunits. Roman numerals refer to interfaces indicated in 
Extended Data Fig. 3c. 
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Figure 3 | Architecture of platform subunits. a, Overall view of APC4; 

b, APC5 with APC15; c, APC1. d, Deletion analysis reveals APC15 density in 
the vicinity of APCS: (left) apo APC/C, (right) apo APC/C“*?°!°, e, APC15- 
assigned density contacts APC8 and APCS. f, Only APC2N7” (cullin repeats) is 
visible in the sharpened electron microscopy map, whereas the flexibility of the 
APC2°T?-APC11 module results in weak fragmented density. g, An 
unsharpened map from three dimensional classification indicates the position 
of the APC2“'?-APC11 module. 


domain of nine a-helices (Figs 1c and 3b) that we assigned to the pre- 
dicted N-terminal o-helical domain of APC5 (APC5N™”), 

APC1, unambiguously identified from its PC (proteasome-cyclosome) 
domain (APC1”, is located adjacent to APC2, APC5 and APCS (Fig. 1a, b). 
APC1" resembles the PC domain of the proteasomal subunit Rpn2 (ref. 22) 
(Figs la, b, 3c and Extended Data Fig. 3e). Its eleven PC repeats assemble 
into a closed toroidal structure of two concentric rings of a-helices encir- 
cling two axial o-helices. Electron microscopy density corresponding 
to these 24 co-linear o-helices is resolved adjacent to APC10. The rod- 
shaped density immediately beneath APC1'©, connecting it to the plat- 
form, resembles an o-helical solenoid (Figs 1b and 3c), and corresponds 
to the predicted ~300-residue mid-helical domain of APC1 (APCIM™?), 
At the interface of the mid-helical and PC domains, a small B-sandwich 
structure (Fig. 3c) was assigned to the predicted B-sheet that immedi- 
ately follows APC1”S. 

A third toroidal-shaped density, located adjacent to APC1M™, was 
assigned as the N-terminal WD40 domain of APC1 (APC pw) (Fig. 3c 
and Extended Data Fig. 3f), consistent with the B-strand prediction for 
the N terminus of APC1. APC1“*° tucks into the helical groove of 
APC57PR (Fig. 1c), and forms an edge-on contact with the C-terminal 
TPR helix of APC8 (Fig. 1b). 

APC15 was identified from difference density from a reconstitution 
of the APC/C with APC15 deleted (APC/C*4?“"*) (Figs 1c, 3d and Ex- 
tended Data Figs 1, 2). This revealed APC15 as the helical density bridg- 
ing APC8 and APCS (Fig. 3e), connected to an extended structure that 
inserts into the inner groove of the APC5'?® superhelix (Fig. 3d, e). An 
interaction of APC15 with APC8 and platform subunits is in agree- 
ment with biochemical and structural evidence**”’. 


The catalytic module is flexible 


For APC2, only a-helices of the N-terminal cullin repeats are clearly 
identified in the electron microscopy map (Fig. 1a and Fig. 3f). Density 
for APC2°' and APC11 is weak and fragmented. In an unsharpened 
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map calculated to reveal low-resolution features, the APC2°T?_APC11 
module is recovered as a horn-like density (Fig. 3g), similar to a feature 
described in the 25 A resolution cryo-electron microscopy map of apo 
S. pombe APC/C"*. Three dimensional classification of the APC/C- 
CDH1-HSLI electron microscopy data indicated structural variability 
of this density (Extended Data Fig. 5), consistent with the notion that 
the APC2°T?-APC11 module is flexible. We used a well-defined three 
dimensional class to dock APC2~7?-APC11, although in this density 
the APC11 RING domain cannot be located precisely (Fig. 3g). Because 
of the flexibility of APC2°T?, we cannot exclude the possibility that 
APC11"'NS js displaced from APC2°"”, similar to the Rbxl RING 
domain of the activated SCF**”. 


Degron recognition by CDH1 and APC10 


CDH1 and APC10are in close proximity within the inner cavity adjacent 
to the APC3 homodimer'*” (Figs la, 4a). We fitted the B-sheets of the 
WD40 £-propeller domain of CDH1 (ref. 26) and the B-sandwich of 
APC10 (refs 27, 28) into their respective densities (Fig. 4a and Extended 
Data Fig. 3h, i). APC10 and CDH1 share structurally related C-terminal 
Tle-Arg motifs (IR tails) that interact with the C-terminal TPR motifs of 
APC3 (refs 12, 27, 29-31). Density for the IR tails of CDH1 and APC10 
is visible extending from their globular domains (Fig. 4a). This shows 
that the two IR tails bind to two equivalent symmetry-related sites in the 
TPR helices of the APC3 homodimer (Fig. 4b). The three TPR motifs of 
the IR tail-binding sites were shown previously to be required for IR 
tail-dependent interactions of CDH1 to the APC/C”. 

Electron microscopy density for the HSL1 substrate is visible at the 
KEN box-recognition site centred on the upper surface of CDH1“?*° 
(ref. 26), whereas D box-assigned density is observed at the D box- 
coreceptor formed by CDH1 and APC10 (refs 15, 26, 30, 32-34) (Fig. 4a, d). 
Other segments of HSL] are unstructured and not defined in the elec- 
tron microscopy map. Crystallographic studies indicated that the D box- 
recognition site on coactivator is a channel on the edge of the WD40 
domain that engages the D box motif RxxLxx(V/I/L) (residues P1 to 
P7), but not conserved polar residues P8 to P10 (ref. 26). In the APC/ 
C-CDH1-HSLI electron microscopy map, we observe tubular density 
that extends from the D box at the coactivator D box-binding site and 
connects with APC10. Modelling this density as P8 to P10 shows that 
these residues interact with a conserved polar surface on APC10 (Fig. 4d). 
Disruption of a loop that comprises this site (the 140s loop) impairs D 
box-dependent substrate recognition’’. Ala substitutions of two hydro- 
philic residues (Ser 88 and Asn 147) in close proximity to P8 to P10 of the 
D box (Fig. 4d) attenuates APC/C°""" ubiquitination activity, as does 
mutation of D box residues P8 to P10 (Extended Data Fig. 6). Thus, the D 
box isa bipartite degron comprising a coactivator-interacting N-terminal 
(RxxLxx(V/I/L)) motif, and a hydrophilic C-terminal APC10-binding 
segment (Fig. 4e). 


CDH1 promotes an allosteric transition 


We could also visualize components of the N-terminal segment of CDH1 
(CDH1N7°). This forms a four o-helix bundle that packs co-linearly with 
the o-helices of APC1”© and also contacts APC6 (Fig. 4f, gand Extended 
Data Fig. 3g). In addition, an extended segment of CDH1“'” (L1) binds 
to the inner groove of the APC8B TPR helix, interacting with three TPR 
motifs that are structurally related to the IR tail-binding site of APC3 
(Fig. 4c), and which contribute to coactivator binding’*”». 

The N terminus of coactivator stimulates APC/C ubiquitin ligase ac- 
tivity, independent of degron recognition’. To understand coactivator- 
mediated APC/C activation in the context of our complete subunit 
assignment, we determined three dimensional reconstructions of apo 
APC/C (Extended Data Figs 1 and 2). The most prominent structural 
difference between apo and ternary complexes involves a displacement 
of the platform subunits (Fig. 5a, b), similar to those described at lower 
resolution for Xenopus and human APC/C'*"*. Moreover, in the apo 
state, electron microscopy density for APC2“'” is more clearly defined 
(Fig. 5a, c) which, coupled to the lack of structural variability of APC2CTP 
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Figure 4 | Mechanism of D box recognition by CDH1 and APC10 and 
interactions with APC/C subunits. a, View showing IR tails of CDH1 and 
APC10 interacting with equivalent, symmetry-related sites on the APC3 
homodimer. b, The IR tails of CDH1 and APC10 interact with equivalent TPR 
motifs 7 to 9 of symmetry-related subunits of the APC3 homodimer. 

c, CDH1N"” interacts with APC8B, including TPR motifs equivalent to the IR 
tail-binding sites of APC3. d, Electron microscopy density (mesh) at the KEN 


in three dimensional classes, indicates reduced structural flexibility of 
APC2°P (Extended Data Fig. 7). 

Superimposing atomic models of apo and ternary states using APC1 me 
and morphing between the two structures, defines the conformational 
transition induced on coactivator binding (Fig. 5b, d and Supplementary 
Video 2). This involves a rigid-body rotation of the platform and C-terminal 
TPR helix of APC8B around a hinge axis that bisects the APC1°°-APC8B 
interface, propagating a conformational change of 20 A to the peripheral 
APC2°'?-APC11 catalytic module (Fig. 5 and Supplementary Video 2). 
Driving this rotation is the interaction of CDH1“"? with APC1°° and 
APC8B. In the apo state, the C-terminal TPR helix of APC8B is well 
ordered and interacts with APC1?© (Fig. 4f). In the ternary complex, 
CDHI1N"” disrupts this interaction by binding toa site on APC1°“ that 
overlaps the site in contact with APC8B. This wedges APC1 PC and APC8B 
apart and disorders TPR motifs 13 and 14 of APC8B (Fig. 4f). A concom- 
itant downwards-displacement of APC8B tilts the platform to translate 
the APC2°TP-APC11 module upwards (Fig. 5d, e). 

Coactivator-induced displacement and flexibility of the APC. 
APC11 module has implications for understanding how the coactivator 
NTD stimulates APC/C ubiquitin ligase activity’. Increased catalytic 
activity could result from the flexibility of the APC2“'?-APC11 mod- 
ule which, combined with its different location relative to the substrate- 
recognition module, may provide enhanced substrate (and ubiquitin 
polymer) access to the APC/C-bound E2-ubiquitin conjugate. Other 
possibilities include (1) enhancing the affinity of the APC/C for the E2- 
ubiquitin conjugate, (2) promoting the closed more reactive configura- 
tion of the E2-donor ubiquitin conjugate to stimulate efficient ubiquitin 
attachment to the target lysine**”, or (3) inducing an allosteric trans- 
ition of the E2 (ref. 41). 

Toassess the effects of CDH1 on the affinity of APC/C for its cognate 
E2s we used surface plasmon resonance (SPR). Ternary APC/C bound 
UBCH10 with a Kp of 0.2 11M, an affinity some threefold higher than 
apo APC/C (Extended Data Fig. 9). Interestingly, coactivator promoted 
a more pronounced sevenfold increase in APC/C’s affinity for UBCH10- 
Ub because ubiquitin conjugation of UBCH10 reduced its affinity for apo 


CTD 
2 = 


iS 
APC/C-CDH1-HSL1 
ternary complex 


Apo (APC114R ING) 


box and D box sites of CDH1. Electron microscopy density for the D box 
interacting with two conserved loops of APC10 is visible. e, D box interacts as a 
bipartite degron with the D box coreceptor of CDH1 and APC10. Shown is a D 
box consensus sequence”. f, Interaction of CDH1N'” with the APCI?° 
displaces APC1°°-APCS interactions. g, CDH1N7” interactions with APC6 
(circled in right panel). 


APC/C nearly threefold, whereas binding to ternary APC/C was only 
slightly affected (Extended Data Fig. 9). In contrast to UBCH 10, ternary 
and apo APC/C had similar affinities for UBE2S (Kp 0.2 and 0.3 uM) 
(Extended Data Fig. 9), showing that coactivator-mediated APC/C acti- 
vation does not enhance UBE2S affinity, and consistent with the idea 
that UBCH10 and UBE2S bind to distinct sites on the complex”. The 
C-terminal extension of UBE2S is required for synthesis of poly-ubiquitin 
chains on APC/C substrates”. Deleting this C terminus to yield the 
UBC domain alone (UBE2S-AC) abolished UBE2S binding to the core 
APC/C™, a finding we confirmed for both apo and holo APC/C using 
SPR (Extended Data Fig. 9). The APC2°TP_APC11 catalytic module had 
a low affinity for UBCH10 and did not interact with either UBCH10-Ub 
or UBE2S (Extended Data Fig. 9), indicating that high-affinity APC/C 
interactions with both UBCH10 and UBE2S depend on subunits other 
than APC2°'P-APC11. 

That ubiquitin modification of UBCH10 interferes with its binding 
to apo APC/C suggests that the energetically-favoured conformations 
of the UBCH10-Ub conjugate, including the catalytically-efficient closed 
conformation**™, are sterically hindered from binding apo APC/C. Thus, 
in addition to an increased affinity for UBCH10-Ub, a coactivator- 
dependent conformational change that allows association of UBCH10- 
Ub in the closed conformation, with enhanced intrinsic catalytic activity, 
could also contribute to stimulation of the E3 ligase activity of APC/C. It 
is therefore likely that, reminiscent of how Nedd8 conjugation stimu- 
lates the SCF“, multiple mechanisms underlie coactivator-dependent 
stimulation of the APC/C, and potentially only UBCH10—responsible 
for initiating substrate ubiquitination*?**°—is subject to coactivator- 
mediated regulation. 

In conclusion, comparison of our APC/C-CDH1-HSL] ternary struc- 
ture with the electron microscopy reconstructions of APC/ CMCS (ref. 14) 
and APC/C-CDH1-EMI] (ref. 47), determined at lower resolutions, 
shows that the presence of coactivator in all three complexes is assoc- 
iated with the same activated conformation of the platform subunits 
(Extended Data Fig. 8a, b). From this analysis it is clear that the MCC 
makes direct contact with the APC2©7?-APC11 module, as does the 
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Figure 5 | Conformational change of the APC/C induced by CDH1‘7”. 

a, Superimposition of ternary and apo APC/C maps. The ternary map is colour- 
coded according to the subunits assignments of Fig. 1. Apo map is in grey. 

b, Superimposition of the apo and ternary coordinates. Apo coordinates are in 
grey except APC2, APC11 and APC4 shown in a lighter shade relative to the 
ternary complex. c, Two views showing the displacement of APC2-APC11 and 
Apc4?"°. d, View of rotation of the platform subunits on transition from apo 
to ternary states. Subunits of apo APC/C are shown in a lighter shade relative to 
the ternary complex. e, Schematic of the platform displacement. Insertion of 
CDH1N"” (depicted asa wedge) | between APC1?° and APC8B causes platform 
rotation about the fixed APC1°° domain. This propagates a 20- A shift to 
APC2°TP-APC11. The rotation axis is shown as a black circle. 


density assigned to the inhibitory zinc-binding region (ZBR) and poly- 
basic tail of EMI1 (ref. 47), suggesting that these inhibitors may inter- 
fere with E2-Ub binding to APC2°'”-APC11 and/or its flexibility. 

A striking feature of the APC/C is the abundance of multiple repeat 
motif proteins (Extended Data Table 1a). The symmetry of TPR homo- 
dimers, coupled to the evolution of paralogues, may have resulted in 
the acquisition of replicated functions. For example, the IR tails of co- 
activator and APC10, and NTD of coactivator bind to equivalent sites 
on three TPR subunits (Fig. 4b, c). Our model of the APC/C provides 
insights into the assembly of other large multimeric complexes. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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A massive galaxy in its core formation phase three 
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Most massive galaxies are thought to have formed their dense stellar 
cores in early cosmic epochs’ *. Previous studies have found galaxies 
with high gas velocity dispersions* or small apparent sizes’, but so 
far no objects have been identified with both the stellar structure and 
the gas dynamics of a forming core. Here we report a candidate core 
in the process of formation 11 billion years ago, at redshift z = 2.3. 
This galaxy, GOODS-N-774, has a stellar mass of 100 billion solar 
masses, a half-light radius of 1.0 kiloparsecs and a star formation rate 
of 90* 50 solar masses per year. The star-forming gas has a velocity 
dispersion of 317 + 30 kilometres per second. This is similar to the 
stellar velocity dispersions of the putative descendants of GOODS- 
N-774, which are compact quiescent galaxies at z~ 2 (refs 8-11) 
and giant elliptical galaxies in the nearby Universe. Galaxies such as 
GOODS-N-774 seem to be rare; however, from the star formation 
rate and size of this galaxy we infer that many star-forming cores may 
be heavily obscured, and could be missed in optical and near-infrared 
surveys. 

We identified the candidate forming core, GOODS-N-774, using the 
3D-HST catalogues in the five CANDELS (Cosmic Assembly Near- 
Infrared Deep Extragalactic Legacy Survey) fields'*. GOODS-N-774 has 
a circularized effective radius of r. = 1.0 kpc from Hubble Space Tele- 
scope (HST) F160W (Hy¢0) Wide Field Camera 3 (WFC3) imaging;”* a 
stellar mass of 1.0 x 10'°Mo (refs 12, 14; Mo, solar mass); rest-frame 
UV] colours consistent with a star-forming galaxy; and a Spitzer Multi- 
band Imaging Photometer (MIPS) 24 jum flux of 104 Jy. Figure 1 shows 
the stellar mass density profile derived from the observed Hy 69 surface 
brightness profile corrected for the HST point spread function’®. The 
surface density profile is strikingly similar to the average profile of mas- 
sive quiescent galaxies at z ~ 2 (red line), and much more concentrated 
than the average profile of massive star-forming galaxies at that redshift'* 
(light blue). 

The near-infrared spectrum of GOODS-N-774 is shown in Fig. 2. The 
continuum is clearly detected, along with emission lines that we identify 
as Ho and [N 1] redshifted to z = 2.300. The gas velocity dispersion is 
o =317+30kms_ |, equivalent to a full-width at half-maximum of 
750kms *. Typically, objects with such large linewidths are mergers 
or are dominated by active galactic nuclei* (AGNs). If the line emission 
in GOODS-N-774 is partly or largely due to the presence of an AGN, 
its velocity dispersion, size and stellar mass measurements would not 
be reliable. 

There is no evidence for the presence of an AGN in GOODS-N-774. 
It is not detected in the deep Chandra 2 Ms X-ray data in GOODS- 
North with an X-ray luminosity upper limit of Lx < 1.2 x 10” ergs '. 
Although an AGN cannot be conclusively ruled out, this upper limit is 
consistent with the star formation rate of the galaxy. Also, the galaxy 
has line ratios [O m]/[O 11] = 0.7 + 0.5, [O m]/HB = 1.2 + 0.9and [N m]/ 
Ha = 0.4 + 0.1, indicating that the gas is in a low-ionization state. 


Therefore, stellar photoionization and, hence, ultimately, star formation, 
is the likely origin of the line emission. Finally, the observed infrared 
spectral energy distribution (SED) requires strong polycyclic aromatic 
hydrocarbon emission to simultaneously explain the MIPS 24 tm and 
Herschel data (Fig. 3), effectively ruling out the presence of a dominant 
AGN. We quantified this by fitting composite SEDs with varying AGN 
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Figure 1 | Structural properties of GOODS-N-774. a, Surface density profile 
of GOODS-N-774 (blue line), as derived from deep WFC3 Hj ¢o imaging. Error 
bars are s.d. The galaxy has a mass of 1.0 x 10''Mo and an effective radius 
ofr. = 1.0 kpc. The light blue curve shows the average profile of 67 star-forming 
galaxies at 1.9<z<2.1 with 10.9 <log(Mgtenar) < 11.2 (refs 12, 13). The red 
curve shows the average profile of 24 quiescent galaxies with the same mass and 
redshift selection criteria. bp-d, Colour images show GOODS-N-774 (b), a 
typical quiescent galaxy (c) and a typical star-forming galaxy (d). Vertical 
bars along the x axis in a indicate effective radii for the galaxies in b-d 
(colour-coded). The structure of GOODS-N-774 is similar to that of 

massive quiescent galaxies. 
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Figure 2 | Velocity dispersion of GOODS-N-774. Near-infrared spectrum in 
two dimensions (a) and 1D (b) obtained with NIRSPEC on KeckI. The grey 
curve is at the original resolution; the black curve shows the spectrum smoothed 
with a 20 A boxcar. The best-fit Gaussians to the Ha 2 = 6,563 A and [Nu] 

2 = 6,548 and 6,584 A emission lines are shown in red. The velocity dispersion 
is 317 + 30kms |, which is equivalent to an inclination-corrected circular 
velocity of V,ot~ 680kms° * if the gas is rotating in a disk. The rest- 

frame equivalent width of Ha is 66 + 8 A and its luminosity is 
(3.4+0.4) x 10ergs 7. 


contributions’’. The best fit is obtained for a pure star-forming template 
with no AGN contribution (Fig. 3). 

We infer that the linewidth of GOODS-N-774 is among the highest 
measured for a normal star-forming galaxy at high redshift (Extended 
Data Fig. 1). If the gas is in a disk, it is rotating with a velocity of Vio. ~ 
550kms 1, or Vio ~ 680 kms’ after correcting for inclination. The 
observed gas velocity dispersion of 317 kms ' is similar to the median 
stellar velocity dispersion of 304kms_' in a sample of quiescent gal- 
axies at z = 1.5-2.2 with median mass 1.9 x 10!°Mo (refs 8-11; Fig. 4). 

The inferred dynamical mass is 1.5 x 10''Mg, which is roughly 1.5 
times the stellar mass, suggesting a gas fraction of < 50%. In Fig. 4, we 
explicitly compare the dynamical and structural properties of GOODS- 
N-774 with those of galaxies in the Sloan Digital Sky Survey (SDSS) and 
those of quiescent galaxies at z ~ 2. The galaxy has a much smaller size 
and a higher velocity dispersion than do SDSS galaxies of the same total 
dynamical mass. Its properties are very similar to those of the samples 
of quiescent galaxies at z ~ 2 that have been compiled over the past few 
years, and we infer that we have identified an example of star-forming 
galaxies in this region of parameter space. 

The Ha luminosity is (3.4 + 0.4) X 10** ergs” ', which implies a min- 
imum star formation rate (with no reddening correction) of ~16Mo yr ' 
for a Chabrier initial mass function’”"*. The red colour of the galaxy 
(Roos — Hieo = 4.2 mag), and the fact that it is detected with MIPS and 
Herschel, suggests that the actual, dust-corrected star formation rate is 
much higher. The 24 jum flux alone indicates a star formation rate of 
135Mo yr’ (ref. 19). Fitting the 24 tm-500 pm data with empirical 
composite star-forming SEDs’* or theoretical models” gives slightly 
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Figure 3 | Ultraviolet/far-infrared spectral energy distribution of GOODS- 
N-774. Rest-frame ultraviolet/far-infrared photometry of GOODS-N-774. 
Error bars are s.d. A stellar population synthesis model fit'* to the ultraviolet/ 
near-infrared SED is shown in grey. The black line shows the composite 
star-formation/AGN SED” that is the best fit to the mid- and far-infrared data. 
This best fit has no AGN contribution and implies a star formation rate of 
90Mo yr~'. For reference, the light blue line shows a composite SED with an 
AGN contribution of 80%. The red curve shows a black body with a size of 1 kpc 
and a bolometric luminosity of 10'? solar luminosities. 


lower values than do the 24 um data alone, and we infer that the star 
formation rate is 90753 Mo yr‘. This confirms that the star formation 
is highly obscured, with ~3 mag of extinction in the direction of the Ha 
emission and an infrared/ultraviolet luminosity ratio of = 200. 

GOODS-N-774 has a specific star formation rate of ~1 X 10° yr i 
which places it on the star-forming sequence at z = 2.3 (ref. 19). If the 
galaxy had a constant star formation rate leading up to the epoch of 
observation, then its mass was built up over a period of ~1 Gyr since 
z = 3.3. Although short compared with the age of the Universe at z = 
2.3, this build-up phase is ~200 dynamical times, which is longer than 
expected from the Kennicutt-Schmidt law”’. This suggests that the gal- 
axy had a higher star formation rate in the past, or that the star forma- 
tion rate has been throttled by the rate of gas accretion onto the halo: a 
galaxy with a stellar mass of M,, = 1.0 x 10!!Mo would havea baryonic 
accretion rate of ~(60—120)Mo yr7! (ref. 22), in good agreement 
with the observed star formation rate. 

The gas in a galaxy such as this, growing by means of rapid star for- 
mation in a deep gravitational potential well, should be rapidly enriched 
with metals, and we would thus expect it to exhibit a high gas-phase 
metallicity. This is consistent with what we observe: the galaxy has [N 11]/ 
Ha = 0.4 + 0.1, which implies a high metallicity (12 + log(O/H)~9.05, 
although the conversion” is somewhat uncertain). After the star for- 
mation phase, the gas is probably heated, expelled or both”. The qui- 
escent core that remains will then probably evolve into a giant elliptical 
galaxy~* with a central stellar metallicity that is similar to the gas-phase 
metallicity of the star-forming core at high redshift”. 

Galaxies such as GOODS-N-774 are rare. Candidate compact star- 
forming galaxies with less extreme properties have been identified in 
fairly large numbers*®, but in the ~900 arcmin’ of the five 3D-HST 
CANDELS fields there are only three objects at 2<_z<3 with 24m 
fluxes of =100 pJy, high central mass densities (log(M/Mo), <1 kpc = 
10.5; ref. 7) and concentrated stellar distributions (r. <= 1 kpc). We 
observed all three galaxies with Keck I, and GOODS-N-774 is the only 
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Figure 4 | Properties of GOODS-N-774 


: compared with quiescent galaxies. Comparisons 
of the size, mass and gas dynamics of GOODS-N- 
774 (blue symbols) to the sizes, masses and stellar 
dynamics of galaxies in the SDSS (black) and 
massive quiescent galaxies at z~ 2 (red)*"". 
GOODS-N-774 has properties that are similar to 
previously studied massive quiescent galaxies at 
z~ 2 and is substantially offset from nearby 
galaxies. CO dynamics and CO sizes of two 
compact SMGs from ref. 13 (HDF 76 and N2850.2) 
are shown in grey. Error bars are s.d. 
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confirmed candidate: GOODS-S-5981° has a narrow linewidth, whereas 
COSMOS-8388 is difficult to interpret because it has an AGN. The num- 
ber density we infer is ~10°° Mpc’ ° (including all three candidates), 
compared with ~10°* Mpc’ ° for the overall population of galaxies with 
dense cores at z ~ 2 (ref. 3). 

This mismatch could imply that the lifetime of the compact star- 
forming phase is very short, as has been suggested previously on the basis 
of similar number density arguments*. It may be that we are witnessing 
the aftermath of the contraction ofa gravitationally unstable star-forming 
disk” or of a merger of large star-forming galaxies*. However, neither 
tidal features nor extended wings are apparent in the surface density 
distribution. 

It is perhaps more likely that the lifetime of the compact star-forming 
phase is relatively long and that we are missing many star-forming com- 
pact galaxies in present surveys. From the compact morphology and 
high star formation rate, we infer a high gas column density for this 
object”!: Nyy = 2.6 X 10°? cm *. This gas column density is nearly an 
order of magnitude higher than in an average ultraviolet-selected star- 
forming galaxy at the same cosmic epoch”, and 2.5 orders of magni- 
tude higher than in the disk of a typical galaxy in the local Universe’. 
This high column density of gas in conjunction with the abundance of 
metals implies” a very high extinction: Ay 2 100 mag for a screen (V, 
visual band), and Ay 2 6 mag if the dust and the stars are mixed. The 
amount of extinction is driven by the dust column density, not the dust 
mass, meaning that at fixed dust mass, a compact galaxy will be more 
obscured than a large galaxy. The detection of rest-frame optical flux, 
and of Ha emission, is inconsistent with such high values for extinction. 
The dust distribution is probably non-uniform, and it may be that, for 
GOODS-N-774, we are looking along a relatively unobscured line of sight. 

More typical star-forming cores could be entirely obscured”””’, and 
begin to resemble black bodies with temperatures of ~30 K (red curve 
in Fig. 3; calculated using a radius of 1 kpc and a bolometric luminosity 
of 10’? solar luminosities). It may be possible to select such obscured 
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progenitors at long wavelengths, near the peak of the redshifted dust 
emission. It has been demonstrated that redshifts, sizes and velocity 
widths of infrared-luminous galaxies can be measured from CO emission. 
In fact, the closest analogues to GOODS-N-774 are the two submillimetre- 
selected galaxies (SMGs) HDF 76 and N2850.2 (Fig. 4), which have high 
linewidths and small sizes in the CO line’. It will be interesting to deter- 
mine whether the stellar distributions of these galaxies are similar to the 
gas distribution, or whether these are dense star-forming regions inside 
larger galaxies. 

Longer-wavelength studies of large, unbiased samples can show whether 
GOODS-N-774 is an example of a parent population of compact star- 
forming galaxies that are heavily obscured*. There may also be mul- 
tiple paths to a compact, quiescent galaxy: some (such as HDF 76 and 
N2850.2*) may form most of their stars in mergers with star formation 
rates of 2 1,000Mo yr~', whereas others (such as GOODS-N-774) may 
grow relatively slowly in an obscured, accretion-throttled mode. What- 
ever the dominant mode turns out to be, because the stars in dense cores 
account for 10-20% of the total z ~ 2 stellar mass density’, star-forming 
cores should account for a significant fraction of all star formation in 
the high-redshift Universe. 

Very recently, evidence supporting our conclusions has been posted 
online”. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Spectral energy distribution. The candidate forming core was found using the 
3D-HST catalogues in the five CANDELS fields. CANDELS is a 902-orbit Hubble 
Space Telescope program that provides space-based optical and near-infrared imag- 
ing across ~900 arcmin’ (refs 30, 31). Aperture photometry was performed to pro- 
duce publicly available photometric catalogues and to derive stellar masses'*"*. Spitzer 
MIPS 24 1m fluxes were determined using the same methodology as in ref. 32. The 
derived fluxes are consistent with the public catalogue of ref. 33. Using the 24 jum 
data as position priors, we measure the 100-500 jum fluxes from the ultra-deep Herschel 
imaging in GOODS-North™. In sum, the rest-frame ultraviolet-optical data come 
from HST/ACS, HST/WFC3 and ground-based optical telescopes; the rest-frame 
near-infrared data are from Spitzer/IRAC; the mid-infrared point is from Spitzer/ 
MIPS; and the far-infrared data are from Herschel/PACS and SPIRE. 

Keck spectroscopy. We observed GOODS-N-774 with the near infrared spectro- 
graph (NIRSPEC) on the Keck I telescope in the K band, on 11 January 2014. The 
total integration time was 6,000 s. We used the low-dispersion mode with a slit width 
of 0.7", giving a spectral resolution of Ginstr = 6.1 A in the rest frame. We fitted a 
Gaussian to the Ho 2 = 6,563 A and [Nu] A = 6,548 and 6,584 A emission lines 
simultaneously and corrected for the instrumental resolution. The uncertainty in 
the derived properties was determined by refitting the model with empirical reali- 
zations of the noise. 

HST grism spectroscopy. A WFC3/G141 grism spectrum of the object was obtained 
as part of the 3D-HST survey. 3D-HST is a near-infrared slitless spectroscopic 
Treasury programme. We examined the grism spectrum after measuring a secure 
redshift from the Keck/NIRSPEC spectrum. The redshifted [O 1], HB and [O m1] 
lines are detected with a significance of 1.50-2.56. GOODS-N-774 has optical emis- 
sion line ratios [O m]/HB = 1.2 + 0.9 and [N 11]/Ha = 0.4 + 0.1, suggesting a level 
of gas excitation that is slightly higher than the locus of star-forming galaxies in the 
local Universe” but at the low end for star-forming galaxies at z ~ 2 (ref. 37) in the 
diagnostic BPT diagram. 

X-ray constraints. GOODS-N-774 is in the Chandra Deep Field North, which has 
been observed for a total of ~2 Ms with the Chandra X-ray satellite. The exposure 
time at the location of GOODS-N-774 is 1.22 Ms. The galaxy is not in the publicly 
available point-source catalogue of this field**. There are seven counts in a 3” aper- 
ture centred on the object location in the full-band (0.5-8 keV) X-ray image, fully 
consistent with the counts in random apertures in regions with the same exposure 
time. Using the s.d. of the counts in random apertures, we derive a2 s.d. upper limit 
of six counts for the X-ray flux of GOODS-N-774. Using PIMMS v4.6b, we derive a 
rest-frame 2-10 keV flux limit of Fy < 2.9 X 10° '’ ergs‘ cm’ ”, corresponding to 
a luminosity of Lx < 1.2 x 10” ergs '. We conclude that there is no evidence for 
an AGN in GOODS-N-774. The upper limit is consistent with the star formation 
rate of the galaxy”. 

Gas column density. We derive the gas surface density using the Kennicutt- 
Schmidt law”’: 


x 1.4+0.15 
Dser = (2.50.7) x 10-*( = ) Mo yr! kpe~? 


1 Mo pe~? 


Dynamical mass. We define dynamical mass as Mayn = k(n)o°r./G, with the con- 
stant k(n) depending on the Sérsic index: k(n) = 8.87 — 0.831n + 0.0241n? (ref. 40). 
GOODS-N-774 has a Sérsic index of n = 2.9; the comparison samples of compact 
quiescent galaxies at z ~ 2 (refs 1, 41-45) and SDSS galaxies with 0.058 < z < 0.060 
have median indices of n = 3.2 and n = 4.1, respectively. 
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A supermassive black hole in an ultra-compact 


dwarf galaxy 


Anil C. Seth!, Remco van den Bosch”, Steffen Mieske*, Holger Baumgardt’*, Mark den Brok’, J ay Strader°, Nadine Neumayer~®, 
Igor Chilingarian’*, Michael Hilker®, Richard McDermid?””, Lee Spitler?°, Jean Brodie", Matthias J. Frank!” & Jonelle L. Walsh’ 


Ultra-compact dwarf galaxies are among the densest stellar systems 
in the Universe. These systems have masses of up to 2 x 10° solar 
masses, but half-light radii of just 3-50 parsecs'. Dynamical mass esti- 
mates show that many such dwarfs are more massive than expected 
from their luminosity’. It remains unclear whether these high dynam- 
ical mass estimates arise because of the presence of supermassive black 
holes or result from a non-standard stellar initial mass function that 
causes the average stellar mass to be higher than expected**. Here we 
report adaptive optics kinematic data of the ultra-compact dwarf gal- 
axy M60-UCD1 that showa central velocity dispersion peak exceeding 
100 kilometres per second and modest rotation. Dynamical model- 
ling of these data reveals the presence of a supermassive black hole 
with a mass of 2.1 x 10’ solar masses. This is 15 per cent of the object’s 
total mass. The high black hole mass and mass fraction suggest that 
M60-UCD1 is the stripped nucleus of a galaxy. Our analysis also 
shows that M60-UCD1’s stellar mass is consistent with its luminosity, 
implying a large population of previously unrecognized supermas- 
sive black holes in other ultra-compact dwarf galaxies’. 

The object M60-UCD1 is the brightest ultracompact dwarf galaxy 
(UCD) currently known’, with a V-band luminosity Ly =4.1 x 10’ Lo 
(where Lo is the solar luminosity) and effective radius r, = 24 pc. It lies 
at a projected distance of 6.6 kpc from the centre of the massive ellip- 
tical galaxy M60 (Fig. 1), and 16.5 Mpc from us*. We obtained integral 
field spectroscopic data between 2 {1m and 2.4 jm of M60-UCD1 with 
the near-infrared integral field spectrograph on the Gemini North tele- 
scope. The high-spatial-resolution data obtained using laser guide-star 
adaptive optics provides a clear detection of the supermassive black hole. 
Modelling of the deep carbon dioxide (CO) absorption bandheads at 
2.3 tum enables us to measure the motions of stars at many different points 
across M60-UCD1. These kinematic measurements are shown in Fig. 2. 
Two features are particularly notable: (1) the dispersion is strongly peaked, 
with the central dispersion rising above 100 kms ' and dropping out- 
wards to ~50kms /, (2) rotation is clearly seen, with a peak amplitude 
of 40kms '. 

The stellar kinematics can be used to constrain the distribution of 
mass within M60-UCD1. This includes being able to test whether the 
mass traces light, or whether a supermassive black hole is required to 
explain the central velocity dispersion peak. We combined the stellar kine- 
matics and imaging from the Hubble Space Telescope with self-consistent 
Schwarzschild models’~* to constrain the black hole mass and the mass- 
to-light ratio (M/L, in solar units), shown in Fig. 3. We measure a black- 
hole mass of 2.1*§'5 x 10’ Mo andag-band M/L = 3.6 + 1 witherrors 
giving 1¢ confidence intervals (20 and 30 contours are shown in Fig. 3). 
The total stellar mass is 1.2 + 0.4 x 108 M, @> Where Mg is the solar mass. 


The best-fitting constant-M/L model with no black hole is ruled out 
with >99.99% confidence. 

M60-UCD1 is the lowest-mass system known to host a supermassive 
black hole (> 10° Mg), including systems with dynamical black hole 
estimates or with broad-line active galactic nuclei’®"’. There have been 
tentative detections of approximately 10* Mg black holes in lower- 
mass clusters'”’’, These detections remain controversial'*’* and the 
intermediate-mass black holes, if present, form a much smaller fraction 
of the total cluster mass than found in M60-UCD1. Of the 75 galaxies 
with reliable dynamical black hole mass measurements, only one other 
galaxy has a black hole mass fraction as high as that of M60-UCD1'°"*. 
A luminous and variable X-ray source was previously detected in M60- 
UCD 1 with a maximum luminosity of Lx = 1.3 x 10°8 ergs s (ref. 5). 
This luminosity suggests the black hole is accreting material at a rate 
typical of black holes in larger, more-massive early-type galaxies as well as 
other nearby galaxies with absorption-line-dominated optical spectra’””*. 
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Figure 1 | Hubble Space Telescope image of the M60-NGC 4647 system. 
M60-UCD1 is the nearly point-like image in the bottom right of a (boxed). The 
discovery of a black hole in M60-UCD1 provides evidence that it is the tidally 
stripped nucleus of a once larger galaxy. We note that NGC 4647 is at 
approximately the same distance from Earth as M60 but the two galaxies are not 
yet strongly interacting. b, A zoomed version of the g-band image of M60- 
UCD1 with contours showing the surface brightness in intervals of one 
magnitude per square arcsecond. The image is from NASA/ESA. 
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Figure 2 | Stellar kinematic maps of M60-UCD1 showing clear rotation and 
a dispersion peak. a and b show the measured radial velocities (bulk motions 
towards and away from us) and velocity dispersions (random motions) of 
the stars in M60-UCD1 with typical errors of 6kms '. Black contours show 
isophotes in the K-band stellar continuum. Kinematics are determined in each 


UCDsare thought to be either the most-massive globular star clusters’” 
or nuclei of larger galaxies that have been tidally stripped”**". The super- 
massive black hole that we have found at the centre of M60-UCD1 pro- 
vides strong evidence that it is a stripped nucleus of a once larger galaxy. 
While it is possible that dense star clusters can form black holes, these 
are expected to contain only a small fraction of the cluster’s mass”. Star 
clusters at the centre of galaxy nuclei, on the other hand, are known to 
host black holes with very high mass fractions”. Thus M60-UCD1 is the 
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Figure 3 | Dynamical modelling results show the presence of a supermassive 
black hole. The figure shows goodness-of-fit contours for the dynamical 
models of M60-UCD1 with two parameters, the g-band mass-to-light ratio and 
the black hole mass. The contours indicate 10 (white), 20, 3 (black, thick) 
confidence levels for two degrees of freedom. Green dots indicate discrete 
values of the mass-to-light ratio and the black hole mass at which models were 
fitted to the data. 
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individual pixel near the centre, but at larger radii the data were binned to 
increase the signal-to-noise ratio and enable kinematic measurements. c and 
d show the best-fitting dynamical model; a black hole is required to replicate the 
central dispersion peak. 


first individual UCD with explicit evidence for being a tidally stripped 
nucleus. 

We can estimate the properties of M60-UCD1’s progenitor galaxy, 
assuming that they follow scaling relations of present-day unstripped 
galaxies. Using the known scaling between black hole mass and bulge 
mass’°, we find a host bulge mass of 7*3 x 10° Mc. Bulge masses are 
also known to correlate with the masses of their nuclear star clusters”. 
In M60-UCD1, the surface brightness profile has two clear components’, 
and we identify the central component as the progenitor nuclear star 
cluster’ with mass 6.1 + 1.6 x 10’ Mo. This translates to a predicted 
bulge mass of 1.8 + 0.4 x 10!° Mg. Thus, two independent scaling rela- 
tions suggest that the progenitor bulge mass is about 10’? Mg. Given 
M60-UCDI’s cluster environment, its progenitor was probably a lower- 
mass elliptical galaxy that was then stripped by the massive elliptical gal- 
axy M60, which lies at a current projected distance of just 6.6 kpc. We 
have run simulations that show that it is feasible to produce M60-UCD1 
by stripping an elliptical galaxy progenitor of approximately 10!° Mo 
on a fairly radial orbit (see Supplementary Information and Extended 
Data Fig. 6). We note that current elliptical galaxies of approximately 
10° Mg have nuclear star cluster sizes, luminosities and colours con- 
sistent with the inner component of M60-UCD1”>”*. 

The detection of a supermassive black hole in M60-UCD1 may be 
just the tip of the iceberg of the UCD black hole population. Measure- 
ments of the integrated velocity dispersion in almost all UCDs with masses 
above 10’ Mg yield dynamical mass estimates that are too high to be 
accounted for bya normal stellar population without a massive central 
black hole’. Unlike these previous dynamical mass estimates, our dynam- 
ical modelling can separate out the gravitational influence of the black 
hole from the contribution of the stars. The models show that M60-UCD1’s 
stellar populations appear normal. The mass-to-light ratio of a stellar 
population characterizes the average mass of its stars. In M60-UCD1 
we measured a stellar dynamical mass-to-light ratio in the g band of 
MIL, = 3.6 = 1.0 in solar units (1o errors). This mass-to-light ratio is 
consistent with the stellar populations seen in lower-mass globular 
clusters” and models with a normal (Milky Way) initial mass function”. 
It is also lower than the integrated dynamical mass-to-light ratio estimates 
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in 18 of 19 UCDs above 10” Mg (ref. 2). The low stellar mass-to-light 
ratio in M60-UCD1 is inconsistent with proposed scenarios in which a 
density-dependent initial mass function yields higher than normal mass- 
to-light ratios*. Without such a mechanism, the simplest explanation for 
the high dynamical mass estimates in massive UCDs is that most host 
supermassive black holes just as M60-UCD1 does. 

There is also more limited evidence for enhanced dynamical mass- 
to-light ratios in lower-mass UCDs. About half of UCDs with masses 
between 3 x 10° Mo and 10’ Mg have higher inferred mass-to-light 
ratios than we see in M60-UCD1’. Taken in combination with tentative 
black hole detections of about 10* Mo in Local Group globular clusters’"* 
(which do not have increased M/L values owing to much lower black 
hole mass fractions), these observations suggest that some lower-mass 
UCDs may also host relatively massive black holes. 

Finally, we estimate what the total population of UCD black holes 
might be in the local Universe. The most complete sample of known 
UCDs is in the Fornax cluster. Comparing the UCD population to the 
population of galaxies in Fornax likely to host massive black holes, we 
find that UCDs may more than double the number of black holes (see 
Supplementary Information). Thus UCD black holes could represent a 
large increase in the massive black hole number density in the local Uni- 
verse. Future work can test this hypothesis. We have ongoing observing 
programmes to obtain similar observations to the ones presented here 
in four additional massive UCDs, and in the most massive star clusters 
in the Local Group. However, dynamical detection of black holes will be 
challenging in all but the brightest and nearest of objects, so accretion 
signatures” or tidal disruption events*’ may represent the best possi- 
bility for detecting black holes in less-massive UCDs. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Aridification of the Sahara desert caused by Tethys 
Sea shrinkage during the Late Miocene 


Zhongshi Zhang”, Gilles Ramstein?, Mathieu Schuster*, Camille Li>, Camille Contoux! & Qing Yan? 


It is widely believed that the Sahara desert is no more than ~2-3 mil- 
lion years (Myr) old’, with geological evidence showing a remarkable 
aridification of north Africa at the onset of the Quaternary ice ages”*. 
Before that time, north African aridity was mainly controlled by the 
African summer monsoon (ASM)*°’, which oscillated with Earth’s 
orbital precession cycles. Afterwards, the Northern Hemisphere gla- 
ciation added an ice volume forcing on the ASM, which additionally 
oscillated with glacial-interglacial cycles’. These findings led to the 
idea that the Sahara desert came into existence when the Northern 
Hemisphere glaciated ~2-3 Myr ago. The later discovery, however, 
of aeolian dune deposits ~7 Myr old’ suggested a much older age, 
although this interpretation is hotly challenged’ and there is no clear 
mechanism for aridification around this time. Here we use climate 
model simulations to identify the Tortonian stage (~7-11 Myr ago) 
of the Late Miocene epoch as the pivotal period for triggering north 
African aridity and creating the Sahara desert. Through a set of ex- 
periments with the Norwegian Earth System Model”’ and the Com- 
munity Atmosphere Model", we demonstrate that the African summer 
monsoon was drastically weakened by the Tethys Sea shrinkage during 
the Tortonian, allowing arid, desert conditions to expand across north 
Africa. Not only did the Tethys shrinkage alter the mean climate of 
the region, it also enhanced the sensitivity of the African monsoon 
to orbital forcing, which subsequently became the major driver of 
Sahara extent fluctuations. These important climatic changes prob- 
ably caused the shifts in Asian and African flora and fauna observed 
during the same period*'*“, with possible links to the emergence 
of early hominins in north Africa’>’*. 

The Sahara desert, the largest non-polar desert in the world today 
(~9,400,000 km’), is widely believed to have appeared during the past 
2 or 3 Myr (ref. 1). A wealth of terrestrial and marine evidence indicates 
aremarkable aridification across north Africa since the onset of the Qua- 
ternary ice ages” *. The arid conditions have not been constant. North 
African aridity varies with the strength of the ASM, which is influenced 
by the ~20-kyr precession cycle*** as well as by global ice volume fluc- 
tuations that follow ~40-kyr or ~100-kyr cycles*®. When orbital preces- 
sion maximizes Northern Hemisphere summer insolation, or when the 
ice volume is small, a strong ASM brings more moisture into north 
Africa from the tropical Atlantic>*. The increased precipitation limits 
the extent of the desert, even replacing it at times with a ‘green Sahara’ 
landscape of vegetation, rivers and lakes that favours occupation by fauna 
and humans. The relationship between north African aridity, the ASM 
and glacial—interglacial cycles has been used as further support for the 
notion that the Sahara desert first appeared at the same time as the ice 
ages started (~2-3 Myr ago). 

Recently, geological evidence for an earlier appearance of the Sahara 
desert has been mounting. Dust flux” and terrestrial vegetation archives* 
from marine sediments indicate the existence of arid periods in north 
Africa since ~8 Myr ago. On land, aeolian dune deposits in the north- 
ern Chad basin suggest that desert conditions existed as early as 7 Myr 


ago’. In the same area, several large palaeolake recurrences recorded 
between 3 and 7 Myr ago also reveal arid~humid cycles’”. This evidence 
for the early onset of Saharan aridity is, however, hotly challenged’, and 
although the appearance of the Sahara desert 2-3 Myr ago can be tied 
to the onset of the Quaternary ice ages, there are no candidate explana- 
tions for its earlier appearance. 

To investigate possible explanations for the onset of Saharan aridity, 
we simulate climate change in north Africa on geological timescales over 
the past 30 Myr. First, we simulate the climate of the Late Oligocene, the 
Early Miocene, the Late Miocene and today, using the coupled low- 
resolution version of the Norwegian Earth System Model (NorESM- 
L)’*. Then, with a high-resolution version of the atmospheric component 
of NorESM-L, the Community Atmosphere Model version 4 (CAM4)", 
we perform sensitivity experiments to identify the dominant drivers for 
the simulated climate changes in north Africa. (See Methods and Ex- 
tended Data for a detailed description of the boundary conditions, exper- 
imental design and a discussion of possible model bias and uncertainties.) 

In the coupled NorESM-L simulations, north Africa experiences a 
pronounced aridification from the Early Miocene to the Late Miocene 
(Fig. 1). In the Late Oligocene and the Early Miocene experiments, north 
Africa is dominated by a semiarid steppe climate (according to the K6ppen 
climate classification) with only restricted areas of arid desert climate 
(Fig. 1a, b). In the Late Miocene experiment, the arid desert climate 
expands across much of north Africa (Fig. 1c), with a greater resemb- 
lance to today’s conditions (Fig. 1d). The aridification is due to a reduc- 
tion in north African precipitation (Fig. 2) from annual values (regionally 
averaged) ~400 mm in the Late Oligocene and the Early Miocene ex- 
periments to <200 mm (a criterion for classifying arid climates) in the 
Late Miocene experiments (Fig. 2e, f). 

The simulated aridification is not caused by changes in vegetation, 
orbital parameters or atmospheric concentrations of greenhouse gas. Veg- 
etation and orbital changes can influence precipitation in north Africa’’, 
but these are identical in all the coupled palaeoclimate experiments and 
hence cannot be responsible for the simulated aridification. Another 
possible cause is the “greenhouse effect’ associated with atmospheric 
CO), whose concentrations decrease from the Oligocene to the Miocene. 
However, when the experiments are repeated with the atmospheric 
CO, concentrations held constant, north Africa experiences reduced 
precipitation (Fig. 2c, d) and aridification (Fig. 2f) in the Late Miocene 
experiment. 

The remaining possibility for explaining the simulated aridification 
is tectonic changes, including changes in orography, bathymetry and, 
most importantly, land-sea distribution. Orographic changes, mainly 
mountain uplift, can be excluded. Previous studies have shown that the 
major orogeneses occurring during the Neogene (the past 23 Myr)—that 
is, the uplift of eastern and southern Africa’ and the Tibetan Plateau*””*— 
do not have a significant impact on north African climate” **. Changes 
in bathymetry that influence regional climate are often linked to changes 
in land-sea distribution. Between the Early Miocene and the Late Miocene, 


1Bjerknes Centre for Climate Research, Uni Research Climate, Allégaten 70, 5007 Bergen, Norway. *Nansen-Zhu International Research Centre, Institute of Atmospheric Physics, Chinese Academy of 
Sciences, 100029 Beijing, China. 3Laboratoire des Sciences du Climat et de l’Environnement/IPSL, CEA-CNRS-UVSQ, UMR8212, Orme des Merisiers, CE Saclay, 91191 Gif-sur-Yvette Cedex, France. 
4Institut de Physique du Globe de Strasbourg (UMR 7516), Ecole et Observatoire des Sciences de la Terre, CNRS and Université de Strasbourg, 1 rue Blessig, 67084 Strasbourg Cedex, France. °Bjerknes 
Centre for Climate Research, Geophysical Institute, University of Bergen, Allégaten 70, 5007 Bergen, Norway. 
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Figure 1 | Tethys palaeogeography and 
schematic north African palaeoclimate. 

Late Oligocene (a), Early Miocene (b), Late 
Miocene (c) and modern (d) time slices. The 
reconstructed Tethys bathymetry is based on the 
palaeogeography map created in refs 26 and 27, 
showing the West Tethys Sea (WT), Paratethys Sea 
(PT), proto-Mediterranean Sea (PM) and Arabian 
peninsula (AP). Hatched areas show semiarid 
steppe climate (orange) and arid desert climate 
(red) from simulations, according to the Képpen 
climate classification. The black lines show the 
simulated climatological mean Intertropical 
Convergence Zone (ITCZ) in summer (June to 
August), which is a measure of the intensity of 
the ASM. 


Figure 2 | North African precipitation from 
coupled experiments. Annual precipitation (mm, 
contours), sea surface temperature (°C, shading) 
and currents (cms ', arrows) simulated in the 
experiments with varying CO) (a, b) and at 
fixed CO, (c, d). a, Early Miocene with CO, at 
420 p.p.m. by volume (p.p.m.v.); b, Late Miocene 
with 350 p.p.m.v. CO); ¢, Early Miocene with 
560 p.p.m.v. CO); d, Late Miocene with 

560 p.p.m.v. CO2. The black dots show the 
positions of climatological mean ITCZ calculated 
according to ref. 6. EQ, Equator. e, f, Annual 
precipitation (mm) averaged over north Africa 
(between 25° N and 35° N, 12° W and 38° E; 
black boxes in a-d) in the Late Oligocene (LO), 
Early Miocene (EM), Late Miocene (LM) and 
modern experiments (Mod.) with the indicated 
atmospheric CO; levels. 
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Figure 3 | Changing moisture transport and 
precipitation due to Tethys shrinkage. 

a, b, Column-integrated moisture transport with a 
large Tethys, representing the Early Miocene 

(a), and modern land-sea distribution, 
representing the Late Miocene (b). The contour 
lines show magnitude (>100kgm™'s~* 
displayed) and the arrows show the direction 
(>150kgm 's7' displayed). c, d, Changes in 
annual precipitation (mm, contours) (c) and 
summer 850-hPa winds (ms ', arrows) (d) due to 


Tethys shrinkage. Only changes that are significant 
at the 95% confidence level (two-tailed unequal 
t-test) are shown. The red (a large Tethys) and 
blue (modern land-sea distribution) dots show the 
positions of climatological mean ITCZ*. 
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the most notable event affecting land-sea distribution was the shrink- 
age of the Tethys Sea**’’, a body of water comprising the proto- 
Mediterranean, West Tethys and Paratethys seas that is the origin of 
the modern Mediterranean, Black and Caspian seas (Fig. 1). In the Late 
Oligocene and Early Miocene, the Tethys was large, with the West Tethys 
seaway connecting the proto-Mediterranean Sea to the Indian Ocean 
(Fig. 1a, b)”*-?”. The Afro-Arabian and Eurasian plates collided during 
the Middle Miocene, closing the West Tethys seaway”. Finally, during 
the Tortonian stage (~7-11 Myr ago) of the Late Miocene, the West Tethys 
was replaced by the Arabian peninsula, and the Paratethys became much 
smaller (Fig. 1c)”’. 

The shrinkage of the Tethys drastically weakens the ASM and causes 
a reorganization of atmospheric moisture transport leading to a signif- 
icant reduction in north African precipitation, as shown in the high- 
resolution CAM4 simulations (Fig. 3). When the Tethys is large, large 
heat fluxes from the ocean surface input energy to the atmospheric col- 
umn above, which exports the energy via a thermally direct circulation. 
There is a continuous transport path bringing moisture from the trop- 
ical Atlantic across all of north Africa in the ASM (Fig. 3a). Replacing 
the Tethys with land leads toa substantial decrease in surface heat fluxes 
as well as in the net energy in the atmospheric column (Extended Data 
Fig. 7). This energy deficit causes anomalous low-level atmospheric 
divergence'* that weakens the ASM and is accompanied bya southward 
retreat of the climatological mean Intertropical Convergence Zone in 
eastern north Africa in both the atmosphere-only experiment (Fig. 3d) 
and the coupled experiment (Fig. 2). As a result, less moisture is trans- 
ported from the tropical Atlantic and precipitation is reduced in north 
Africa. 

In addition to causing a change in mean climate, the Tethys shrinkage 
also increases north African climate sensitivity to orbital forcing (Fig. 4 
and Extended Data Figs 8 and 9). As described above, when Northern 
Hemisphere summer insolation is stronger (such as at 6 kyr ago), the 
amplified land-sea contrast sharpens the pressure gradient between 
the north African continent and the tropical Atlantic and strengthens 
the ASM. However, a given insolation change causes a weaker atmo- 
spheric response (near-surface winds and precipitation shown in Fig. 4) 
with a large Tethys than with a modern land-sea distribution. These 
experiments thus predict a change in how the precession cycle drives 
climatic and hydrological variations depending on the land-sea distri- 
bution of the region. 

Of the various stages in the shrinking of the Tethys Sea, the replace- 
ment of the West Tethys by the Arabian peninsula during the Tortonian 
stage (~7-11 Myr ago) of the Late Miocene seems most important for 


60°E 80°E 

the onset of Saharan aridity (see Methods and Extended Data Fig. 4). 
Before the Tortonian, north Africa had a semiarid climate (Fig. 1a, b) 
and was not very sensitive to orbital changes (Fig. 4a), implying that the 
region was generally covered with vegetation and lakes, much as it was 
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Figure 4 | Climate response to increased summer insolation in the Northern 
Hemisphere before and after Tethys shrinkage. Response in annual 
precipitation (mm, shading) and 850-hPa summer winds (m s_ |, arrows) toa 
change in orbital parameters from today to 6 kyr ago (higher minus lower 
Northern Hemisphere summer insolation) for continental configuration with a 
large Tethys (a) and a modern land-sea distribution (b). Only changes that are 
significant at the 95% confidence level (two-tailed unequal t-test) are shown. 
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during the Quaternary African Humid Periods”*. After the Tortonian, 
north Africa became more arid (Fig. 1c, d) on average and desert condi- 
tions prevailed, but with periodic fluctuations to a green Sahara in re- 
sponse to orbital forcing, as expected from our experiments (Fig. 4b). The 
arid—humid fluctuations are evident from the Tortonian to the present*”’, 
paced by the ~20-kyr precession cycles during the Late Miocene and 
Pliocene’, over which glacial—interglacial cycles are superimposed dur- 
ing the Quaternary’. 

Previous studies have linked the Tethys shrinkage with climate changes 
further east, in particular those associated with an enhancement of the 
South Asian summer monsoon (SASM)”*. Geological evidence shows 
increased upwelling in the Indian Ocean at ~7-8 Myr ago”, an indi- 
cator for wind changes due toa stronger SASM. Additionally, modelling 
experiments show that central Eurasian summer temperatures increase 
in response to the Tethys shrinkage, which would also enhance the mon- 
soon circulation”®. Our study further supports this idea. When the West 
Tethys Sea is replaced by the Arabian peninsula, the resulting changes 
in the atmospheric energy balance produce a near-surface cyclonic anom- 
aly that strengthens the western branch of the SASM (the Somali jet; 
Fig. 3). 

Here we have identified the pivotal role of the shrinkage of the Tethys 
Sea during the Tortonian stage for north African aridification. Model- 
ling experiments show that the altered land-sea distribution affects the 
region’s monsoon circulations and enhances sensitivity to orbital for- 
cing, permitting the appearance of oscillating arid (desert) and humid 
(green Sahara) periods paced by precession. These results indicate that 
the Tortonian stage was critical in the shift from a permanently vege- 
tated Sahara to a desert exhibiting arid—humid oscillations at orbital time- 
scales. This climate shift could be responsible for the pronounced shifts 
in flora and fauna that occurred during the Late Miocene, for example 
the replacement of C3-photosynthesizing plants by C, plants*”, the de- 
mise of Tethyan laurel forests’* and the split of the endemic African 
mammalian order Macroscelidea into two species”, all of which have 
been ascribed to the widespread aridification of north Africa and Asia. 
Moreover, the marked change to modern-like climate conditions in the 
region is likely to have affected the emergence of early hominins in Africa, 
notably the Sahara’’. The heightened sensitivity to orbital forcing after 
the Tortonian has intriguing links to mounting evidence that changes 
in precipitation due to precession were fundamental to the evolution 
and dispersal of hominins in north Africa’®. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Spreading continents kick-started plate tectonics 


Patrice F. Rey', Nicolas Coltice** & Nicolas Flament' 


Stresses acting on cold, thick and negatively buoyant oceanic litho- 
sphere are thought to be crucial to the initiation of subduction and 
the operation of plate tectonics’, which characterizes the present- 
day geodynamics of the Earth. Because the Earth’s interior was hotter 
in the Archaean eon, the oceanic crust may have been thicker, thereby 
making the oceanic lithosphere more buoyant than at present’, and 
whether subduction and plate tectonics occurred during this time is 
ambiguous, both in the geological record and in geodynamic models’. 
Here we show that because the oceanic crust was thick and buoyant’, 
early continents may have produced intra-lithospheric gravitational 
stresses large enough to drive their gravitational spreading, to initi- 
ate subduction at their margins and to trigger episodes of subduc- 
tion. Our model predicts the co-occurrence of deep to progressively 
shallower mafic volcanics and arc magmatism within continents in 
a self-consistent geodynamic framework, explaining the enigmatic 
multimodal volcanism and tectonic record of Archaean cratons®. More- 
over, our model predicts a petrological stratification and tectonic struc- 
ture of the sub-continental lithospheric mantle, two predictions that 
are consistent with xenolith’ and seismic studies, respectively, and con- 
sistent with the existence of a mid-lithospheric seismic discontinuity’. 
The slow gravitational collapse of early continents could have kick- 
started transient episodes of plate tectonics until, as the Earth’s inte- 
rior cooled and oceanic lithosphere became heavier, plate tectonics 
became self-sustaining. 

Present-day plate tectonics is primarily driven by the negative buoy- 
ancy of cold subducting plates. Petrological and geochemical proxies of 
subduction preserved in early continents point to subduction-like pro- 
cesses already operating before 3 billion years (Gyr) ago*” and perhaps 
as early as 4.1 Gyr ago’’. However, they are not unequivocal, and geo- 
dynamic modelling suggests that the thicker basaltic crust produced by 
partial melting of a hotter Archaean or Hadean mantle would have had 
increased lithospheric buoyancy and inhibited subduction**. Mantle con- 
vection under a stagnant lid with extensive volcanism could therefore 
have preceded the onset of subduction”’. In this scenario, it is classically 
assumed that the transition from stagnant-lid regime to mobile-lid regime 
and the onset of plate tectonics require that convective stresses overcame 
the strength of the stagnant lid’? at some stage in the Archaean. 

On the modern Earth, gravitational stresses due to continental buoyancy 
can contribute to the initiation of subduction’. The role of continental 
gravitational stresses as a driver of Archaean lithospheric deformation 
has been emphasized'*”°; however, their potential to initiate subduction 
has been overlooked. Studies of xenoliths from Archaean cratons show 
that the early continental crust was underlain by a thick (~200 km) lith- 
ospheric mantle, moderately to strongly depleted and therefore buoyant”. 
A common model for the formation of early continental lithosphere 
invokes partial melting in mantle plumes, leading to magnesium-rich 
mantle residues (for example, refractory harzburgites and dunites) under 
thick basaltic plateaux*’*””. Partial melting of these thick basaltic crusts, at 
depths >40 km, further differentiates the crust into tonalite-trondjemite- 
granodiorite (TTG) and restitic material'*’*. 

First-order calculations show that the horizontal gravitational force 
acting between a continent 200 km thick and adjacent oceanic lithosphere 
is of the order of 10'* Nm _! (see Extended Data Fig. 1), comparable to 


that of present-day tectonic forces driving orogenesis'. To explore the 
tectonic impact of a thick and buoyant continent surrounded by a stag- 
nant lithospheric lid, we produced a series of two-dimensional thermo- 
mechanical numerical models of the top 700 km of the Earth, using 
temperature-dependent densities and visco-plastic rheologies that depend 
on temperature, melt fraction and depletion, stress and strain rate (see 
Methods). The initial temperature field is the horizontally averaged tem- 
perature profile of a stagnant-lid convection calculation for a mantle 
~200 K hotter than at present (Fig. 1A, aand Extended Data Fig. 2). The 
absence of lateral temperature gradients ensures that no convective stresses 
act on the lid, allowing us to isolate the dynamic effects of the continent. 
A buoyant and stiff continent 225 km thick (strongly depleted mantle 
root 170 km thick overlain by felsic crust 40 km thick; see Fig. 1B, a) is 
inserted within the lid, on the left side of the domain to exploit the sym- 
metry of the problem (Fig. 1A, a). A mafic crust 15 km thick covers the 
whole system (Fig. 1A, a), consistent with the common occurrence of 
thick greenstone covers on continents, as well as thick basaltic crust on 
the oceanic lid’. 

Our numerical solutions show that the presence of a buoyant con- 
tinent imparts a horizontal force large enough to induce a long period 
(~50-150 Myr) of slow collapse of the whole continental lithosphere 
(Fig. 1 and Extended Data Fig. 3), in agreement with the dynamics of 
spreading for gravity currents’’. Hence, a continent of larger volume leads 
to larger gravitational power and faster collapse. Because of lateral spread- 
ing of the continent, the adjacent lithospheric lid is slowly pushed under 
its margin (Fig. 1A, b and Extended Data Fig. 3A, a). For gravitational 
stress lower than the yield stress of the oceanic lid, thickening of the margin 
of the lid is slow, and viscous drips (that is, Rayleigh-Taylor instabilities) 
detach from its base (Extended Data Fig. 3A, a and b). These instabilities, 
typical of stagnant-lid convection”, mitigate the thermal thickening of 
the lid. 

When gravitational stresses overcome the yield stress of the lithospheric 
lid, subduction is initiated (Fig. 1A, b and c). Depending on the half-width 
of the continent and its density contrast with the adjacent oceanic lid (that 
is, its gravitational power) three situations can arise: first, subduction 
initiates and stalls (Extended Data Fig. 3b); second, the slab detaches and 
the lid stabilizes (Fig. 1A, d and e and Extended Data Fig. 3c); or third, 
recurrent detachment of the slab continues until recycling of the oceanic 
lid is completed, followed by stabilization (Extended Data Fig. 3d). When 
the slab reaches a depth of ~200 km, slab pull contributes to drive sub- 
duction and rapid rollback of the subducting lid, which in turn promotes 
lithospheric boudinage and continental rifting (Fig. 1A, c and Extended 
Data Fig. 3C, b-d). Through spreading and thinning of the continent, 
its base rises from 225 km to ~75 km deep on average, and shallower 
between lithospheric boudins (Figs 1A, d and 2b and Extended Data 
Fig. 3C, cand dand D, b-f). This triggers an episode of deep (~ 150 km) 
to shallow (<100km) decompression melting and progressive deple- 
tion of the ambient fertile mantle (Figs 2 and 3b). Harzburgites of the 
continental mantle are too refractory to melt on decompression. Poly- 
baric melting of fertile mantle produces a basalt cover 6 km thick anda 
mantle residue ~75 km thick with an average depletion of 7.5% (Figs 2 
and 3c). The bulk of depletion occurs during a volcanic episode lasting up 
to ~13 Myr, although partial melting persists for up to 45 Myr (Fig. 2b). 
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Continent 


Figure 1 | Numerical solution of an example of continent collapse leading to 
subduction. A, a, Modelling setup (0 Myr). b-e, Computed snapshots for a box 
700 km deep and 6,300 km long including a continent 225 km thick with a 
half-width of 800 km. b, 46.7 Myr; c, 55.3 Myr; d, 57.2 Myr; e, 123.8 Myr. 

All mantle rocks have a limiting yield stress of 300 MPa. Mantle cooler than 
1,620 K is in blue (darker blue is hotter); mantle hotter than 1,620 K is in pink 
(darker pink is hotter). Regions of depletion due to partial melting of ambient 
fertile mantle are hatched. B, Compositional structure, reference densities 
and reference rheological profile for the continent (a) and for the adjacent 
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lithospheric lid (b). This numerical solution documents the long phase of 
slow continental spreading leading to the initiation of a slab (A, b and c). 
Once the slab has reached a depth of ~200 km, slab pull contributes to drive 
subduction, rollback and continental boudinage (A, c) (in some experiments 
boudinage leads to rifting) and slab detachment (A, d). In this experiment the 
detachment of the slab is followed by a long period of thermal relaxation and 
stabilization during which the thickness of the continent increases through 
cooling and incorporation of the moderately depleted mantle (A, e). 
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Figure 2 | Development of layering of the continental lithosphere through 
thinning and progressive accretion of moderately depleted mantle. a, Points 
located at depths Ao to A; before spreading are exhumed during spreading 
to locations A’ to A;’, following the blue pressure-temperature-time path. 
The geotherm intersects the solidus, and the temperature in the partially molten 
column remains close to the solidus because latent heat is continuously 
extracted with the melt once melt fraction has reached 1%. Melt is extracted 
from various depths following the melt adiabat (yellow paths). The region 
between the solidus and liquidus of the hydrous fertile mantle is mapped for 
MgO content (see Methods). The deeper part of the column produces 
komatiitic basalts (dark blue shading), whereas partial melting at pressures 
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<3 GPa produces tholeiitic basalts (pale blue shading). b, Temporal evolution 
of the laterally averaged depletion (blue), partial melting (yellow) and density 
interfaces (thick dashed lines). As spreading and thinning proceed, pure 
shear fabrics (shown as finite strain ellipses) develop in the refractory mantle 
and in the moderately depleted mantle, which records a shorter strain history. 
The base of the partially molten column remains close to ~150 km, whereas 
its top progressively rises from ~150 km at the beginning of partial melting 
(at ~44 Myr) to an average of ~75 km below the surface (at ~55 Myr). From 
55 Myr, spreading slows down and progressive cooling reduces the amount of 
melt, until partial melting stops at ~82 Myr. This results in the progressive 
chemical and structural stratification of the lithospheric mantle. 
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Figure 3 | Proposed model for the co-evolution of cratonic crust and 
sub-continental lithospheric mantle. Integration of the results of our 
numerical experiments with petrological data supports a model linking the 
formation of continents to the initiation of subduction at their margins, 
through the process of continent collapse. This model predicts the layering of 
the sub-continental lithospheric mantle (SCLM), polybaric and multimodal 
volcanism recorded in greenstone cover, and the metasomatism of the SCLM. 
a, Partial melting in the deep mantle leads to the formation of an oceanic 
plateau that differentiates into a continent. The residue of mantle melting forms 
the strongly depleted harzburgite root of the continent, whereas the deeper 
part of the basaltic crust differentiates by partial melting into TTG. b, As the 
continent grows in thickness and length, excess gravitational potential energy 
drives its collapse and the shortening of the adjacent oceanic lid (in grey). 
During collapse and thinning of the continent, decompression melting of the 


In the last ~25 Myr of melting, the sub-continental mantle cools until 
melting stops (Figs 1A, e and 2). Decompression melting allows the spread- 
ing continent to maintain a minimum chemical thickness of at least 140- 
150 km. After the melting phase, conductive cooling results in the thermal 
thickening and strengthening of a chemically stratified cratonic litho- 
sphere (Figs 1A, e, 2 and 3d). Over the whole process, the buoyancy of 
the continent decreases, subduction stops and a stagnant lid regime is 
re-established (Fig. 1A, e). 

Trade-offs between yield stress, gravitational stress and continental 
volume determine the initiation of subduction in our models (Extended 
Data Fig. 3b, d). Fora yield stress of 150-300 MPa, consistent with recent 
estimates of rheological parameters of the lithospheric mantle”, increas- 
ing the width of the continent favours the initiation of subduction (Ex- 
tended Data Fig. 4). These results suggest an increasing potential for 
subduction as continental area increased over time. 

Not only do our models confirm important results from previous stud- 
ies, but they also provide innovative explanations for key attributes of 
Archaean cratons. Both transient subduction and dripping styles are 
consistent with previous models of Archaean”’ and modern’ subduc- 
tion. As observed in previous work, the length of the detached segments 
increases with the yield stress of the lid’’. Our models confirm that a com- 
bination of the buoyancy and high viscosity of Archaean sub-continental 
lithospheric mantle, and large plastic strain weakening, prevent the re- 
cycling of the sub-continental lithospheric mantle, which explains its 
longevity**. Our models account for the average thickness (~6 km) and 


1 Metasomatized SCLM { 


fertile ambient mantle and extraction of deep (komatiitic basalt) to shallow 
(tholeiite) melts (yellow) contribute to the growth of the greenstone cover, and 
to the formation of a moderately depleted mantle layer. c, As a result of the 
horizontal push of the collapsing continent, the thickened margin of the 
oceanic lid subducts underneath the continental margin. Partial melting of the 
thick, eclogitized oceanic crust produces TTG melts (purple), which 
metasomatise both the mantle wedge and the lithospheric mantle. Melting of 
the hydrated and metasomatized mantle wedge produces calc-alkaline to 
sanukitoid melts (orange). d, After detachment of the slab, and once the 
gravitational power of the continent is too small to deform its surrounding, the 
continent thickens through thermal relaxation and cooling, first incorporating 
the layer of moderately depleted mantle and then a layer of unmelted fertile 
mantle. 


duration of volcanism (~ 10-50 Myr) of greenstone covers, and predict 
polybaric (5-2 GPa) partial melting, involving deep (garnet-bearing) to 
shallow sources (Fig. 2). The MgO content of basaltic melts produced at 
these pressures ranges between 11% and 17% (Fig. 2a), consistent with 
the komatiitic basalts (12-18% MgO) typical of Archaean greenstones’””». 
In our model, tholeiitic basalts (6-12% MgO), abundant in Archean 
greenstones”, can be produced in regions of continental necking where 
partial melting can occur at pressures <3 GPa. Figure 2a shows that to 
account for the formation of komatiites (MgO > 18%) our model would 
simply require a mantle potential temperature greater than 1,820 K. When 
subduction starts, arc volcanism is expected at convergent margins while 
continental extension and rifting still operate (Fig. 1A, c). Our model can 
therefore explain the metasomatism and production of sanukitoid melts 
through the migration of younger TTG melts generated by the partial 
melting of subducting eclogitized basaltic crust (Fig. 3c). 

On modern Earth, mantle plumes mostly occur away from subduction 
zones”*. Hence, the eruption duration and sequential interlayering of koma- 
tiite, tholeiite, calc-alkaline and felsic volcanics, ubiquitous in Archaean 
greenstone belts*”””*, frequently attributed to repeated interaction between 
mantle plumes and subduction zones over hundreds of millions of years’, 
remains enigmatic. However, our model predicts this co-occurrence of 
deep (up to 150 km) to shallow (<100 km) mafic volcanics (Fig. 2b) and 
arc magmatism, in a self-consistent geodynamic framework. 

Moreover, our model predicts a progressive chemical stratification 
of the sub-continental lithospheric mantle concomitant with that of the 
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continental crust and growth of the greenstone cover. This is consistent 
with the strong geochemical layering of cratonic mantle inferred by 
geochemical and petrological studies of mantle xenoliths. Pure shear 
stretching and thinning during the collapse promotes development of 
a subhorizontal litho-tectonic fabric in the refractory harzburgite and 
dunite, and to a smaller extent in the accreted moderately depleted man- 
tle (Fig. 2b). The predicted litho-tectonic layering can explain the seismic 
mid-lithospheric discontinuity at about 100 km depth observed within 
cratons”. This discontinuity’ could correspond to the sharp transition 
predicted by our model between the strongly stretched, strongly depleted 
primary root of the continent and the moderately stretched, moderately 
depleted-to-fertile mantle accreted through cooling (Fig. 3). 

We propose that the collapse of early continents was a key process in 
Archaean geodynamics, resulting in the concomitant structuration of 
the mantle root and the crust of cratons. This process would have kick- 
started transient episodes of plate tectonics, until plate tectonics became 
self-sustaining through the increasing continental area” and the decreas- 
ing buoyancy of oceanic plates’. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Ancient human genomes suggest three ancestral 
populations for present-day Europeans 


A list of authors and their affiliations appears at the end of the paper 


We sequenced the genomes of a ~7,000-year-old farmer from Ger- 
many and eight ~8,000-year-old hunter-gatherers from Luxembourg 
and Sweden. We analysed these and other ancient genomes’ * with 
2,345 contemporary humans to show that most present-day Europeans 
derive from at least three highly differentiated populations: west Euro- 
pean hunter-gatherers, who contributed ancestry to all Europeans but 
not to Near Easterners; ancient north Eurasians related to Upper Pal- 
aeolithic Siberians*, who contributed to both Europeans and Near 
Easterners; and early European farmers, who were mainly of Near 
Eastern origin but also harboured west European hunter-gatherer 
related ancestry. We model these populations’ deep relationships and 
show that early European farmers had ~44% ancestry from a ‘basal 
Eurasian’ population that split before the diversification of other 
non-African lineages. 

Near Eastern migrants from Anatolia and the Levant are known to 
have played a major role in the introduction of agriculture to Europe, as 
ancient DNA indicates that early European farmers were distinct from 
European hunter-gatherers** and close to present-day Near Easterners*®. 
However, modelling present-day Europeans as a mixture of these two 
ancestral populations* does not account for the fact that Europeans are 
also admixed with a population related to Native Americans’®. To clar- 
ify the prehistory of Europe, we sequenced nine ancient genomes (Fig. 1 
and Extended Data Fig. 1): ‘Stuttgart’ (19-fold coverage), a ~7,000-year- 
old skeleton found in Germany in the context of artefacts from the first 
widespread farming culture of central Europe, the Linearbandkeramik; 
‘Loschbour’ (22-fold), an ~8,000-year-old skeleton from the Loschbour 
rock shelter in Luxembourg, discovered in the context of hunter-gatherer 
artefacts (Supplementary Information sections 1 and 2); and seven ~8,000- 
year-old samples (0.01-2.4-fold) from a hunter-gatherer burial in Motala, 
Sweden (the highest coverage individual was “Motala12’). 

Sequence reads from all samples revealed >20% C—T and G>A 
deamination-derived mismatches at the ends of the molecules that are 
characteristic of ancient DNA*”® (Supplementary Information section 3). 
We estimate nuclear contamination rates to be 0.3% for Stuttgart and 
0.4% for Loschbour (Supplementary Information section 3), and mito- 
chondrial (mtDNA) contamination rates to be 0.3% for Stuttgart, 0.4% 
for Loschbour, and 0.01-5% for the Motala individuals (Supplementary 
Information section 3). Stuttgart has mtDNA haplogroup T2, typical of 
Neolithic Europeans", and Loschbour and all Motala individuals have 
the U5 or U2 haplogroups, typical of hunter-gatherers*° (Supplementary 
Information section 4). Stuttgart is female, whereas Loschbour and five 
Motala individuals are male (Supplementary Information section 5) and 
belong to Y-chromosome haplogroup I, suggesting that this was common 
in pre-agricultural Europeans (Supplementary Information section 5). 

Wecarried out large-scale sequencing of libraries prepared with ura- 
cil DNA glycosylase (UDG), which removes deaminated cytosines, thus 
reducing errors arising from ancient DNA damage (Supplementary Infor- 
mation section 3). The ancient individuals had indistinguishable levels 
of Neanderthal ancestry when compared to each other (~2%) and to 
present-day Eurasians (Supplementary Information section 6). The het- 
erozygosity of Stuttgart (0.00074) is at the high end of present-day Euro- 
peans, whereas that of Loschbour (0.00048) is lower than in any present 
human populations (Supplementary Information section 2); this must 


reflect a strong bottleneck in Loschbour’s ancestors, as the genetic data 
show that he was not recently inbred (Extended Data Fig. 2). High copy 
numbers for the salivary amylase gene (AMY1) have been associated 
with a high starch diet'’; our ancient genomes are consistent with the 
direction of this observation in that the Stuttgart farmer had the high- 
est number of copies (16), whereas the ancient hunter-gatherers La 
Brana (from Iberia)’, Motalal2, and Loschbour had lower numbers (5, 6 
and 13, respectively) (Supplementary Information section 7). We caution, 
however, that copy count in Loschbour is at the high end of present-day 
humans, showing that high copy counts of AMY1 cannot be accounted 
for entirely by selection since the switch to agriculture. Both Loschbour 
and Stuttgart had dark hair (>99% probability); and Loschbour, like 
La Brafa and Motala12, probably had blue or light coloured eyes (>75%) 
whereas Stuttgart probably had brown eyes (>99% probability) (Sup- 
plementary Information section 8). Neither Loschbour nor La Brafia 
carries the skin-lightening allele in SLC24A5 that is homozygous in 
Stuttgart and nearly fixed in Europeans today’, but Motala12 carries at 
least one copy of the derived allele, showing that this allele was present 
in Europe before the advent of agriculture. 

We compared the ancient genomes to 2,345 present-day humans from 
203 populations genotyped at 594,924 autosomal single nucleotide poly- 
morphisms (SNPs) with the Human Origins array* (Supplementary Infor- 
mation section 9) (Extended Data Table 1). We used ADMIXTURE” 
to identify 59 ‘west Eurasian’ populations that cluster with Europe and 
the Near East (Supplementary Information section 9 and Extended Data 
Fig. 3). Principal component analysis (PCA)'* (Supplementary Informa- 
tion section 10) (Fig. 2) indicates a discontinuity between the Near East 
and Europe, with each showing north-south clines bridged only by a few 
populations of mainly Mediterranean origin. We projected’* the newly 
sequenced and previously published’ ancient genomes onto the first 
two principal components (PCs) (Fig. 2). Upper Palaeolithic hunter- 
gatherers’ from Siberia like the MA1 (Mal’ta) individual project at the 
northern end of the PCA, suggesting an ‘ancient north Eurasian’ (ANE) 
meta-population. European hunter-gatherers from Spain’, Luxembourg, 
and Sweden‘ fall beyond present-day Europeans in the direction of Euro- 
pean differentiation from the Near East, and form a ‘west European 
hunter-gatherer’ (WHG) cluster including Loschbour and La Brafia’, 
and a ‘Scandinavian hunter-gatherer’ (SHG) cluster including the Motala 
individuals and ~5,000-year-old hunter-gatherers from the Pitted Ware 
Culture*. An ‘early European farmer’ (EEF) cluster includes Stuttgart, 
the ~5,300-year-old Tyrolean Iceman’ and a ~5,000-year-old Swedish 
farmer’. 

Patterns observed in PCA may be affected by sample composition 
(Supplementary Information section 10) and their interpretation in terms 
of admixture events is not straightforward, so we rely on formal analysis 
of f statistics* to document mixture of at least three source populations 
in the ancestry of present Europeans. We began by computing all possi- 
ble statistics of the form f3( Test; Ref;, Ref.) (Supplementary Information 
section 11), which if significantly negative show unambiguously* that 
Test is admixed between populations anciently related to Ref, and Ref, 
(we choose Ref, and Ref, from 5 ancient and 192 present populations). 
The lowest f; statistics for Europeans are negative (93% are > 4 stan- 
dard errors below 0), with most showing strong support for at least one 
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Figure 1 | Map of west Eurasian populations. Geographical locations of 
analysed samples, with colour coding matching the PCA (Fig. 2). We show all 
sampling locations for each population, which results in multiple points for 
some (for example, Spain). 


ancient individual being one of the references (Supplementary Infor- 
mation section 11). Europeans almost always have their lowest f; with 
either (EEF, ANE) or (WHG, Near East) (Supplementary Information 
section 11, Table 1 and Extended Data Table 1), which would not be 
expected if there were just two ancient sources of ancestry (in which 
case the best references for all Europeans would be similar). The lowest 
fy statistic for Near Easterners always takes Stuttgart as one of the ref- 
erence populations, consistent with a Near Eastern origin for Stuttgart’s 
ancestors (Table 1). We also computed the statistic f,(Test, Stuttgart; 
MAI, Chimp), which measures whether MA1 shares more alleles with 
a Test population or with Stuttgart. This statistic is significantly posi- 
tive (Extended Data Fig. 4 and Extended Data Table 1) if Test is nearly 
any present-day West Eurasian population, showing that MA1-related 
ancestry has increased since the time of early farmers like Stuttgart (the 
same statistic using Native Americans instead of MA1 has the same 


sign but is smaller in magnitude (Extended Data Fig. 5), indicating that 
MAL isa better surrogate than the Native Americans who were first used 
to document ANE ancestry in Europe”*). The analogous statistic f,(Test, 
Stuttgart; Loschbour, Chimp) is nearly always positive in Europeans and 
negative in Near Easterners, indicating that Europeans have more ancestry 
from populations related to Loschbour than do Near Easterners (Extended 
Data Fig. 4 and Extended Data Table 1). Extended Data Table 2 docu- 
ments the robustness of key f, statistics by recomputing them using trans- 
version polymorphisms not affected by ancient DNA damage, and also 
using whole-genome sequencing data not affected by SNP ascertain- 
ment bias. Extended Data Fig. 6 shows the geographic gradients in the 
degree of allele sharing of present-day West Eurasians (as measured by 
fastatistics) with Stuttgart (EEF), Loschbour (WHG) and MAI (ANE). 

To determine the minimum number of source populations needed 
to explain the data for many European populations taken together, we 
studied the matrix of all possible statistics of the form fy(Testpase. Testy 
Opase Oj) (Supplementary Information section 12). Testyase is a reference 
European population, Test; is the set of all other European Test popula- 
tions, Opase is a reference outgroup, and O; is the set of other outgroups 
(ancient DNA samples, Onge, Karitiana, and Mbuti). The rank of the (i, /) 
matrix reflects the minimum number of sources that contributed to the 
Test populations'*’”. For a pool of individuals from 23 Test populations 
representing most present-day European groups, this analysis rejects 
descent from just two sources (P< 10” bya Hotelling t-test'”). How- 
ever, three source populations are consistent with the data after exclud- 
ing the Spanish who have evidence for African admixture'*” (P = 0.019, 
not significant after multiple-hypothesis correction), consistent with the 
results from ADMIXTURE (Supplementary Information section 9), PCA 
(Fig. 2 and Supplementary Information section 10) and f statistics (Ex- 
tended Data Table 1, Extended Data Fig. 6, Supplementary Information 
sections 11 and 12). We caution that the finding of three sources could be 
consistent with a larger number of mixture events. Moreover, the source 
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Table 1 | Lowest f3 statistics for each west Eurasian population 


LETTER 


Ref; Refs Target for which these two references give the lowest f3(X; Ref, Ref2) 

WHG EEF Sardinian*** 

WHG Near East Basque, Belarusian, Czech, English, Estonian, Finnish, French_South, Icelandic, Lithuanian, Mordovian, Norwegian, Orcadian, 
Scottish, Spanish, Spanish_North, Ukrainian 

WHG Siberian Russian 

EEF ANE Abkhasian***, Albanian, Ashkenazi_Jew****, Bergamo, Bulgarian, Chechen****, Croatian, Cypriot****, Druze**, French, Greek, 
Hungarian, Lezgin, Maltese, Sicilian, Turkish_Jew, Tuscan 

EEF Native American Adygei, Balkar, Iranian, Kumyk, North_Ossetian, Turkish 

EEF African BedouinA, BedouinB+, Jordanian, Lebanese, Libyan_Jew, Moroccan_Jew, Palestinian, Saudi****, Syrian, Tunisian_Jew***, 
Yemenite_Jew*** 

EEF South Asian Armenian, Georgian****, Georgian_Jew*, lranian_Jew***, lraqi_Jew*** 


WHG = Loschbour or LaBrafa; EEF = Stuttgart; ANE = MA1; Native American = Piapoco; African = Esan, Gambian, or Kgalagadi; South Asian = GujaratiC or Vishwabrahmin. Statistics are negative with Z< —4 
unless otherwise noted: +(positive) or *, **, ***, ****, to indicate Z less than 0, —1, —2, and —3, respectively. The complete list of statistics can be found in Extended Data Table 1. 


populations may themselves have been mixed. Indeed, the positive f, 
(Stuttgart, Test; Loschbour, Chimp) statistics obtained when Test is Near 
Eastern (Extended Data Table 1) imply that the EEF had some WHG- 
related ancestry, which was greater than 0% and as high as 45% (Sup- 
plementary Information section 13). 

We used the ADMIXTUREGRAPH software*”* to fit a model (a tree 
structure augmented by admixture events) to the data, exploring models 
relating the three ancient populations (Stuttgart, Loschbour, and MA1) 
to two eastern non-A fricans (Onge and Karitiana) and sub-Saharan Afri- 
cans (Mbuti). We found no models that fit the data with 0 or 1 admixture 
events, but did find a model that fit with 2 admixture events (Supplemen- 
tary Information section 14). The successful model (Fig. 3) confirms the 
existence of MA1-related admixture in Native Americans’, but includes 
the novel inference that Stuttgart is partially (44 + 10%) derived from 
a lineage that split before the separation of eastern non-Africans from 
the common ancestor of WHG and ANE. The existence of such basal 
Eurasian admixture into Stuttgart provides a simple explanation for our 
finding that diverse eastern non-African populations share significantly 
more alleles with ancient European and Upper Palaeolithic Siberian 


Basal Eurasian 


Eastern non-African 


Ancient north Eurasian 


West Eurasian 244 + 10% 


41 18% 


European 


Figure 3 | Modelling the relationship of European to non-European 
populations. A three-way mixture model that is a fit to the data for many 
populations. Present-day samples are coloured in blue, ancient in red, and 
reconstructed ancestral populations in green. Solid lines represent descent 
without mixture, and dashed lines represent admixture. We print mixture 
proportions and one standard error for the two mixtures relating the highly 
divergent ancestral populations. (We do not print the estimate for the 
‘European’ population as it varies depending on the population.) 


hunter-gatherers than with Stuttgart (that is, f,(Eastern non-African, 
Chimp; Hunter-gatherer, Stuttgart) is significantly positive), but that 
hunter-gatherers appear to be equally related to most eastern groups 
(Supplementary Information section 14). We verified the robustness of 
the model by reanalysing the data using the unsupervised MixMapper’ 
(Supplementary Information section 15) and TreeMix”' software (Sup- 
plementary Information section 16), which both identified the same 
admixture events. The ANE-WHG split must have occurred > 24,000 years 
ago (as it must predate the age of MA] (ref. 3)), and the WHG and Eastern 
non-African split must have occurred > 40,000 years ago (as it must 
predate the Tianyuan” individual from China which clusters with Asians 
to the exclusion of Europeans). The basal Eurasian split must be even 
older, and might be related to early settlement of the Levant” or Arabia”*”° 
before the diversification of most Eurasians, or more recent gene flow 
from Africa’®. However, the basal Eurasian population shares much of the 
genetic drift common to non-African populations after their separation 
from Africans, and thus does not appear to represent gene flow between 
sub-Saharan Africans and the ancestors of non-Africans after the out-of- 
Africa bottleneck (Supplementary Information section 14). 

Fitting present-day Europeans into the model, we find that few pop- 
ulations can be fit as two-way mixtures, but nearly all are compatible with 
three-way mixtures of ANE-EEF-WHG (Supplementary Information 
section 14). The mixture proportions from the fitted model (Fig. 4 and 
Extended Data Table 3) are encouragingly consistent with those obtained 
from a separate method that relates European populations to diverse out- 
groups using f, statistics, assuming only that MA1 is an unmixed descen- 
dent of ANE, Loschbour of WHG, and Stuttgart of EEF (Supplementary 
Information section 17). We infer that EEF ancestry in Europe today 
ranges from ~30% in the Baltic region to ~90% in the Mediterranean, 
consistent with patterns of identity-by-descent (IBD) sharing”””* (Sup- 
plementary Information section 18) and shared haplotype analysis (chro- 
mosome painting)”’ (Supplementary Information section 19) in which 
Loschbour shares more segments with northern Europeans and Stuttgart 
with southern Europeans. Southern Europeans inherited their European 
hunter-gatherer ancestry mostly via EEF ancestors (Extended Data Fig. 6), 
whereas northern Europeans acquired up to 50% of WHG ancestry above 
and beyond what they received through their EEF ancestors. Europeans 
have a larger proportion of WHG than ANE ancestry in general. By 
contrast, in the Near East there is no detectable WHG ancestry, but up 
to ~29% ANE in the Caucasus (Supplementary Information section 
14). A striking feature of these findings is that ANE ancestry is inferred 
to be present in nearly all Europeans today (with a maximum of ~20%), 
but was absent in both farmers and hunter-gatherers from central and 
western Europe during the Neolithic transition. However, ANE ances- 
try was not completely absent from the larger European region at that 
time: we find that it was present in ~8,000-years-old Scandinavian 
hunter-gatherers, as MA1 shares more alleles with Motalal2 (SHG) 
than with Loschbour, and Motala12 fits as a mixture of 81% WHG and 
19% ANE (Supplementary Information section 14). 

Two sets of European populations are poor fits for the model. Sicilians, 
Maltese, and Ashkenazi Jews have EEF estimates of > 100%, consistent 
with their having more Near Eastern ancestry than can be explained via 
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e Albanian = French = French_South ¢ Estonian 
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Figure 4 | Estimates of mixture proportions in present-day Europeans. Plot 
of the proportions of ancestry from each of three inferred ancestral populations 
(EEF, ANE and WHG). 


EEF admixture (Supplementary Information section 17). They also can- 
not be jointly fit with other Europeans (Supplementary Information 
section 14), and they fall in the gap between European and Near East- 
erners in PCA (Fig. 2). Finns, Mordovians and Russians (from the north- 
west of Russia) also do not fit (Supplementary Information section 14; 
Extended Data Table 3) due to East Eurasian gene flow into the ances- 
tors of these north-eastern European populations. These populations 
(and Chuvash and Saami) are more related to east Asians than can be 
explained by ANE admixture (Extended Data Fig. 7), probably reflect- 
ing a separate stream of Siberian gene flow into north-eastern Europe 
(Supplementary Information section 14). 

Several questions will be important to address in future ancient DNA 
work. One question concerns where and when the Near Eastern farmers 
mixed with European hunter-gatherers to produce the EEF. A second ques- 
tion concerns how the ancestors of present-day Europeans first acquired 
their ANE ancestry. Discontinuity in central Europe during the late Neo- 
lithic (~4,500 years ago) associated with the appearance of mtDNA types 
absent in earlier farmers and hunter-gatherers” raises the possibility 
that ANE ancestry may have also appeared at this time. Finally, it will 
be important to study ancient genome sequences from the Near East to 
provide insights into the history of the basal Eurasians. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Lethal aggression in Pan is better explained by 
adaptive strategies than human impacts 


Michael L. Wilson’, Christophe Boesch’, Barbara Fruth*®, Takeshi Furuichi®, Ian C. Gilby”®, Chie Hashimoto®, 
Catherine L. Hobaiter’, Gottfried Hohmann’, Noriko Itoh’®, Kathelijne Koops", Julia N. Lloyd”, Tetsuro Matsuzawa®!?, 
John C. Mitani’*, Deus C. Mjungu!®, David Morgan’®, Martin N. Muller!’”, Roger Mundry'®, Michio Nakamura”, Jill Pruetz!’, 


Anne E. Pusey’, Julia Riedel’, Crickette Sanz”°, Anne M. Schel”!, 


Nicole Simmons", Michel Waller??, David P. Watts?’, 


Frances White”, Roman M. Wittig?, Klaus Zuberbtihler?’*+ & Richard W. Wrangham”> 


Observations of chimpanzees (Pan troglodytes) and bonobos (Pan 
paniscus) provide valuable comparative data for understanding the 
significance of conspecific killing. Two kinds of hypothesis have 
been proposed. Lethal violence is sometimes concluded to be the 
result of adaptive strategies, such that killers ultimately gain fit- 
ness benefits by increasing their access to resources such as food or 
mates'°. Alternatively, it could be a non-adaptive result of human 
impacts, such as habitat change or food provisioning®°. To discri- 
minate between these hypotheses we compiled information from 
18 chimpanzee communities and 4 bonobo communities studied 
over five decades. Our data include 152 killings (n = 58 observed, 
41 inferred, and 53 suspected killings) by chimpanzees in 15 com- 
munities and one suspected killing by bonobos. We found that 
males were the most frequent attackers (92% of participants) and 
victims (73%); most killings (66%) involved intercommunity attacks; 
and attackers greatly outnumbered their victims (median 8:1 ratio). 
Variation in killing rates was unrelated to measures of human impacts. 
Our results are compatible with previously proposed adaptive expla- 
nations for killing by chimpanzees, whereas the human impact hypo- 
thesis is not supported. 

Substantial variation exists in rates of killing across chimpanzee study 
sites**"°"”, The human impact and adaptive strategies hypotheses both 
seek to explain this variation, but have contrasting predictions, which we 
test here (Tables 1 and 2). The human impact hypothesis states that 
killing is an incidental outcome of aggression, exacerbated by human 
activities such as deforestation, introducing diseases, hunting or pro- 
viding food. Accordingly, lethal aggression should be high where human 
disturbance is high’. 


Table 1 | Predicted patterns of lethal aggression 


In contrast, the adaptive strategies hypothesis views killing as an 
evolved tactic by which killers tend to increase their fitness through 
increased access to territory, food, mates or other benefits’>'°'”. Kin 
selection’® and evolutionary game theory” yield a set of specific predic- 
tions for how benefits and costs should vary with the context, age, sex, 
and genetic relatedness of the attackers and targets. Lethal aggression 
occurs within a diverse set of circumstances, but is expected to be most 
commonly committed by males; directed towards males; directed towards 
non-kin, particularly members of other groups; and committed when 
overwhelming numerical superiority reduces the costs of killing. Previ- 
ous studies have developed and tested these specific hypotheses**""""’; 
the present study represents the first effort to test multiple hypotheses 
simultaneously with a comprehensive data set. We assembled data from 
communities of eastern (n = 12) and western (n = 6) chimpanzees” 
studied over 426 years (median = 21 years; range: 4-53) and from 4 
bonobo communities studied for 92 years (median = 21; range: 9-39; 
Extended Data Fig. 1). We rated each case of killing as observed, in- 
ferred, or suspected (see Methods; Extended Data Tables 1-4). To be 
conservative, we limited our analyses to those rated ‘observed’ and 
‘inferred’ unless otherwise noted. We examined contrasting predictions 
relating to overall patterns of killings (Table 1) and variation among 
communities (Table 2). 

Bonobos are consistently found to be less violent than chimpanzees””’, 
and lower rates of killing are reported for western than eastern chim- 
panzees”"’. The human impact hypothesis could in theory ascribe these 
variations to different levels of disturbance. In contrast, in behavioural 
ecology, distinct populations are expected to respond to prevailing 
ecological circumstances through biological evolution and/or phenotypic 


Adaptive strategies hypothesis 


Variable Human impact hypothesis 
Chimpanzees kill more than bonobos None 

Rate of killing over time + 

Sex bias: attackers one 

Sex bias: victims None 

Age of victims one 

Genetic relatedness of attackers and victims None 

Numerical asymmetries None 


+ 

one 

Mainly males 

Mainly males 

Mainly young infants (most vulnerable and/or reduce time to mother’s next estrus) 
Mainly non-relatives (for example, members of other communities) 

Victims greatly outnumbered 
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Table 2 | Predicted correlates of number of killings per study com- 
munity 


Variable Human impact Adaptive strategies 
hypothesis hypothesis 

Provisioning (provisoned) + None 

Size of protected area, km? (area) 7 None 
Disturbance rating (disturbance) te None 

Eastern vs. western chimpanzees (clade) | None + 

Mean number of adult males (males) None ca 

Mean population density (density) None + 


flexibility. For bonobos and western chimpanzees, ecological factors 
apparently allow relatively high gregariousness, which reduces the risk 
of experiencing a lethal attack”"'. Our data set covers all major studies 
of both species of Pan, which include sites with and without a history of 
provisioning, and with high and low levels of human disturbance, a 
rating estimated independently by each site’s director(s) (Methods; 
Extended Data Figs la and 2a). 

We documented killings by chimpanzees in 15 of 18 communities 
(58 observed, 41 inferred, and 53 suspected cases; Extended Data 
Tables 1-4) (Fig. 1). For bonobos, we documented only a single (sus- 
pected) case, which occurred at Lomako, a never-provisioned site with 
a low disturbance rating. No killings were recorded at other bonobo 
sites, including one with a history of provisioning and a high disturbance 
rating (Wamba). Controlling for years of observation, chimpanzees 
had a higher rate of killing than bonobos; this difference was statist- 
ically significant for eastern but not western chimpanzees (Poisson 
regression: n = 22 communities; estimated coefficients + s.e. for chimpan- 
zees compared to bonobos: fp = —4.5 + 1.0; Beast = 3.4 + 1.0, z = 3.3, 
P= 0.0008; Bwyest = 0.65 + 1.2, z= 0.56, P = 0.57; overall effect of clade: 
7 = 80.8, df = 2, P< 0.0001). This difference persisted when ‘suspected’ 
cases were included (Extended Data Table 5a). 

To investigate which factors best explained variation in killing rates 
among chimpanzee communities, we used an information theoretic 
approach”, controlling for years of observation. We considered three 
variables for the human impact hypothesis: provisioned (whether the 
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Figure 1 | Number of victims killed per year by members of study 
communities. Bars indicate the annual rate of observed (black), inferred 
(grey), and suspected (white) killings by each community for bonobos 

(B; n = 4), eastern chimpanzees (E; n = 12), and western chimpanzees 

(W; n = 6). Communities with a history of provisioning are indicated by (P). 
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community had been artificially fed); area (size of protected area, with 
smaller areas assumed to experience more impacts); and disturbance. 
We also considered three variables for the adaptive strategies hypo- 
thesis: clade (eastern and western chimpanzees may have different 
histories of selection for violence); males (number of adult males, 
which may influence rates of killing via intensity of reproductive com- 
petition and/or coalitional fighting power), and density (number of 
individuals per km’, which may affect frequency of intercommu- 
nity encounter and/or intensity of resource competition). We consider 
density to reflect natural food abundance. For example, at Ngogo (4.5 
chimpanzees per km”), vegetation sampling revealed high forest pro- 
ductivity” and chimpanzees have high C-peptide levels”, indicating 
high energy balance; whereas at Fongoli (0.37 chimpanzees per km/), 
chimpanzees range widely across a dry savannah with sparse food”. 
Density was unrelated to disturbance (general linear model, F, 16 = 1.4, 
P=0.26). 

Of the 16 models we considered (Table 3), four of the five models in 
the resulting 95% confidence set included combinations of the adaptive 
variables; the fifth model included the three human impact variables. 
The best model included only males and density, and was supported 
6.8 times more strongly than the human impact model (evidence 
ratio = w,/w; = 0.40/0.059 = 6.8). Considering model-averaged para- 
meter estimates”’, increases in males and density increased the number 
of killings; for all other parameter estimates, the 95% confidence inter- 
vals included zero (Table 3 and Fig. 2). Excluding one community 
(Ngogo) that had both an unusually high killing rate and unusually 
many males resulted in similar values for model-averaged parameters, 
but only the estimate for density excluded zero from the 95% con- 
fidence interval (Extended Data Table 5b; n = 17). 

Opposite to predictions from the human impact hypothesis (Table 2), 
provisioned and disturbance both had negative effects; the estimates 
for these parameters included zero in the 95% confidence intervals 
(Table 3 and Extended Data Fig. 2b). The highest rate of killing occurred 
at a relatively undisturbed and never-provisioned site (Ngogo); chim- 
panzees at the least disturbed site (Goualougo) were suspected of one 
killing and inferred to have suffered an intercommunity killing; and 
no killings occurred at the site most intensely modified by humans 
(Bossou). 

As a test of confidence, we investigated the effects of including sus- 
pected cases and data from bonobos. Including suspected cases changed 
western and provisioned from negative to positive (Extended Data 
Table 5b). Nonetheless, even with these suspected cases, none of the 
estimates for human impact variables excludes zero from the 95% 
confidence interval. Including bonobo data widened the confidence 
intervals for density (Extended Data Table 5b), probably because two 
bonobo communities had high densities (Extended Data Fig. 1a). With 
either suspected cases or bonobo data added, only for males did the 
95% confidence intervals exclude zero (Extended Data Table 5b). Thus, 
although demographic variables explain variation in rates of killing 
better than human impact variables, the confidence intervals are sensitive 
to including suspected cases or data from another species (bonobos). 

These analyses combine killings committed for varied reasons by 
individuals in different age-sex classes. A full explanation of these events 
requires a finer grained analysis. To this end, we examined variation over 
time and among different categories of attacker and victim. 

Increasing human impacts have been proposed to cause increasing 
numbers of killings in recent years*. However, controlling for changes 
in the number of communities observed per year (communities), the 
rate of killing has not changed over time (year). Using an information 
theoretic approach” to compare three different models (year; com- 
munities; and year plus communities), the best model contained only 
communities; considering model-averaged parameters, the 95% con- 
fidence interval excluded zero for communities, but not year (Poisson 
regression: n = 52 years; model-averaged parameters and 95% con- 
fidence interval: By = 10 (—38 to 58); Byear = —0.0058 (—0.022 to 
0.010); Bcommunities = 0.18 (0.10-0.26); Extended Data Table 5c). 
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Table 3 | Summary of model selection: number of killings per community 


B Clade Males Density Area Prov Dist K A Wi 
=3.6 0.081 0.21 4 0.00 0.40 
=23 =19 0.073 4 0.61 0.30 
=3.1 -14 0.073 0.15 5 1.8 0.16 
2.7 0.087 3 3.4 0.07 
il —0.0016 -14 =0.63 5 3.8 0.06 
—2.2 2.4 0.10 0.42 —0.00083 1.3 —O0:27 8 10 0.00 
3.7 —0.0011 -—0.40 4 12 0.00 
=20 =2.1 0.17 4 17 0.00 
-1.2 =—27 3 18 0.00 
=28 0.28 3 21 0.00 
=1.1 —0.00042 3 24 0.00 
=1.1 —0.00042 —0.12 4 28 0.00 
=15 2 34 0.00 
-16 0.19 3 36 0.00 
-1.4 —0.011 3 37 0.00 
=-16 0.18 —0.0046 4 40 0.00 

MAP —24 —0.78 0.073 O14 —0.00010 =0.078 —0.038 

2.5% = 5.0 =—L8 0.053 0.00029 —0.00027 -0.24 =O. 

97.5% 0.12 0.25 0.093 0.22 0.000083 0.082 0.033 


Parameters include the intercept (B); impact of western relative to the eastern clade of chimpanzees; mean number of adult males per community (males); mean population density per community (density); size 
of protected area in km? (area); history of regular provisioning with food (prov); disturbance rating (dist.); the number of free parameters (k) including the dispersion parameter (é = 2.8); the difference in Akaike 
information criterion (corrected for overdispersion: QAICc) between the ith model and the best model (4)); and model weight (w;). Models are arranged in order from best (lowest AQAICc;) to worst (highest AQAICc). 
The weight of the model (w;) is the probability that a given model is the best model in a given set of models. Model-averaged parameter estimates (MAP) with upper (97.5%) and lower (2.5%) bounds of the 95% 


confidence intervals are given in the bottom rows. 


Killings involved a median of five male attackers (range: 0-19) and 
no females (range: 0-6). Considering all cases for which the number of 
attackers was observed (n = 58) or could be inferred (n = 6), males 
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Figure 2 | Number of killings per year for each community versus number 
of males and population density. Rates for each community are indicated 
by black diamonds (chimpanzees; n = 18) and open squares (bonobos; 

n= 4). Black lines indicate simple linear regression for chimpanzee data for 
illustrative purposes only; statistical tests were done using Poisson regressions. 
a, Number of killings versus number of males. b, Number of killings versus 
population density (individuals per km”). 
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constituted 92% of participants in attacks (338/366). Controlling for 
observation time and community composition, males were much more 
likely to participate in killings than females (negative binomial mixed 
model: n = 36 observations (fixed effects: sex with 2 levels; random 
effects: community with 18 levels); By = —6.9 + 0.98; Bmates = 2.6 + 0.59, 
z= 4.42, P<0.0001). Females sometimes joined males in attacking 
grown individuals (n = 3), but when acting without males, females 
killed only young infants (n = 8). 

Controlling for observation time and community composition, males 
and infants had the highest probability of being killed (Extended Data 
Table 6). Notably, during infanticides, attackers sometimes removed 
infants from mothers under circumstances in which they appeared cap- 
able of killing the mother as well, but did not do so. 

Most victims were members of different communities from the 
attackers (n = 62 of 99 cases; 63%) and thus not likely to be close kin”. 
This difference is particularly striking given that chimpanzees could 
potentially attack members of their own community on a daily basis, 
but rarely encounter members of other communities (for example, 
1.9% of follow days at Kanyawara’’). 

Intercommunity killings mainly involved parties with many males 
(median = 9 males, range: 2-28, n = 36 cases with known numbers of 
attackers) attacking isolated or greatly outnumbered males or, more 
often, mothers with infants (median = 0 males, range: 0-3, n = 30; 
median = 1 female, range: 0-5, n = 31). For 30 cases in which the num- 
ber of adult and adolescent males and females on each side were known, 
attackers outnumbered defenders by a median factor of 8 (range: 1-32; 
Extended Data Table 7). Most intercommunity killings thus occurred 
when attackers overwhelmingly outnumbered victims. 

Several robust patterns emerge from these data. Killing was most 
common in eastern chimpanzees and least common among bonobos. 
Among chimpanzees, killings increased with more males and higher 
population density, whereas none of the three human impact variables 
had an obvious effect. Male chimpanzees killed more often than females, 
and killed mainly male victims; attackers most frequently killed unweaned 
infants; victims were mainly members of other communities (and thus 
unlikely to be close kin); and intercommunity killings typically occurred 
when attackers had an overwhelming numerical advantage. The most 
important predictors of violence were thus variables related to adaptive 
strategies: species; age-sex class of attackers and victims; community 
membership; numerical asymmetries; and demography. We conclude that 
patterns of lethal aggression in Pan show little correlation with human 
impacts, but are instead better explained by the adaptive hypothesis that 
killing is a means to eliminate rivals when the costs of killing are low. 
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Optimization of lag time underlies antibiotic 
tolerance in evolved bacterial populations 


Ofer Fridman’, Amir Goldberg", Irine Ronin', Noam Shoresh? & Nathalie Q. Balaban! 


The great therapeutic achievements of antibiotics have been dramat- 
ically undercut by the evolution of bacterial strategies that overcome 
antibiotic stress'*. These strategies fall into two classes. ‘Resistance’ 
makes it possible for a microorganism to grow in the constant pres- 
ence of the antibiotic, provided that the concentration of the anti- 
biotic is not too high. ‘Tolerance’ allows a microorganism to survive 
antibiotic treatment, even at high antibiotic concentrations, as long 
as the duration of the treatment is limited. Although both resistance 
and tolerance are important reasons for the failure of antibiotic 
treatments*°, the evolution of resistance’? is much better understood 
than that of tolerance. Here we followed the evolution of bacterial 
populations under intermittent exposure to the high concentrations 
of antibiotics used in the clinic and characterized the evolved strains 
in terms of both resistance and tolerance. We found that all strains 
adapted by specific genetic mutations, which became fixed in the 
evolved populations. By monitoring the phenotypic changes at the 
population and single-cell levels, we found that the first adaptive 
change to antibiotic stress was the development of tolerance through 
a major adjustment in the single-cell lag-time distribution, without 
a change in resistance. Strikingly, we found that the lag time of bac- 
teria before regrowth was optimized to match the duration of the 
antibiotic-exposure interval. Whole genome sequencing of the evolved 
strains and restoration of the wild-type alleles allowed us to identify 
target genes involved in this antibiotic-driven phenotype: ‘tolerance 
by lag’ (tbl). Better understanding of lag-time evolution as a key deter- 
minant of the survival of bacterial populations under high antibiotic 
concentrations could lead to new approaches to impeding the evolu- 
tion of antibiotic resistance. 

We exposed batch cultures of the bacterium Escherichia coli to a high 
concentration of ampicillin (100 1g ml‘), which is 15 times greater 
than the minimum inhibitory concentration (MIC), ina cyclic manner. 
The frequency of mutants that were resistant to this concentration’? 
was less than 10° ''. In each cycle (Fig. 1a), parallel overnight cultures 
in small volumes were resuspended in fresh medium containing ampi- 
cillin. After a fixed duration, T’,, of daily exposure to the antibiotic, the 
cultures were washed to remove the drug, resuspended in fresh med- 
ium and grown overnight. We evolved six populations, two for each of 
the three exposure durations: T, = 3, 5 and 8 h. In all cases, the bacteria 
soon adapted to the stressful regimen, and follow-up experiments estab- 
lished the genetic basis of this adaptation. 

In the first cycle, the proportion of surviving bacteria at each T, was 
less than 0.1%. After eight to ten cycles, ampicillin became at least an 
order of magnitude less effective at killing the bacteria (Fig. 1b and Ex- 
tended Data Fig. 1). The increased survival rate of the evolved bacteria 
could have been achieved by a mutation that conferred resistance. We 
found, however, that the MIC of ampicillin for clones isolated from the 
evolved lines (tbl3a, tbl5a and tbl8a, evolved at T, = 3,5 and 8 h, respec- 
tively) was indistinguishable from that for their ancestors (Fig. 1c). Sim- 
ilar to the MIC, the measure MDKg, (the minimum duration for killing 
99% of cells) may be defined to quantify tolerance (Box 1). A higher tol- 
erance translates to a longer MDKop: that is, a treatment of longer duration 


is needed to reach the same level of killing. We measured the MDKgg 
for the wild-type and evolved strains and found that the MDKog of the 
evolved population increased with the duration of the stress period, 
reaching values as high as 15 times the MDKoy of the wild-type strain 
(Fig. 1d). We conclude that these populations have all adapted to the 
antibiotic regimen through tolerance and not resistance. 

Different mechanisms can enable bacteria to endure an interval of 
exposure to antibiotics. One way to achieve tolerance is by slowed gro 
However, no reduction in growth that could account for the observed 
tolerance was measured (Extended Data Fig. 2a). Another strategy for 
tolerance is related to the cells being in stationary phase before expo- 
sure to antibiotics, resulting in a delay in regrowth when switched toa 
new environment. By extending the time to first division (the single-cell 
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Figure 1 | Resistance and tolerance of the evolved strains. a, Experimental 
design for cyclic exposure to antibiotics. In each cycle, a small-volume 
overnight culture was resuspended in a larger volume of fresh medium 
containing 100 ug ml‘ ampicillin for an exposure time T,. After the antibiotic 
was washed out, the culture was resuspended in fresh medium and grown 
overnight. b, Survival of the evolved strains and ancestral strain after 

100 pg ml + ampicillin treatment for T, = 3, 5 or 8h: strains tbi3a, tbl5a and 
tbl8a, respectively (see Extended Data Table 1). Data are presented as the 
mean + s.d. of two independent experiments. c, MIC test carried out using disc 
diffusion antibiotic sensitivity testing. d, Increase in tolerance of the evolved 
strains. MDKogg (see Box 1) was determined by measuring the time to kill 99% of 
the population. Data are presented as the mean + s.d. from four experiments. 
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BOX | 
Schematics of resistance and 
tolerance to bactericidal antibiotics 


An antibiotic treatment is characterized by the concentration at which 
itis administered and its duration. The outcome of a treatment can be 
quantified by the fraction of bacteria killed by the drug. The curves 
(Box 1 Figure, panel a) indicate lines of equal killing for the wild-type 
(WT) strain. The cyan curve represents the combinations of treatment 
durations and concentrations that result in 99% killing. The curve 
displays two asymptotic behaviours: the vertical asymptote shows the 
concentration below which the culture will not be killed, even for a very 
long antibiotic treatment, namely the MIC. The horizontal asymptote 
shows the characteristic time needed to kill 99% of the culture, in 
the limit of high antibiotic concentrations. We term this asymptotic 
value the minimum duration for killing, or MDKg9. Schematics of the 
99% killing curves for WT (cyan), resistant (R, orange) and tolerant 
(T, blue) strains are shown in Box 1 Figure, panel b. An increase in 
resistance translates into a shift in the MIC to higher concentrations, 
whereas an increase in tolerance manifests as a shift in the MDKg9 
to longer treatment durations. 
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lag time’”) and remaining longer in a dormant state, a cell may avoid 
the harms of antibiotics. A case in point is the tolerance of type I per- 
sistent bacteria’, which is based on the existence of a subpopulation of 
cells with sufficiently long single-cell lag times'’. To explore the pos- 
sibility of the evolution ofa lag-time-related strategy, we directly moni- 
tored, under the microscope, the time to first division of single cells taken 
from an overnight culture in stationary phase (Fig. 2a). Whereas cells 
from the ancestral clone divided within half an hour, the lag times of 
the evolved cells were distributed over many hours. 

To corroborate that extended lag time is the main factor reducing 
the killing rate by ampicillin in our protocol, we measured the survival 
of evolved clones under exposure to norfloxacin, a drug that belongs to 
a different class of bactericidal antibiotic. The expectation was that the 
benefits of the slow exit from the dormant phase are generic and would 
carry over to a different drug that targets growing bacteria’*”’. Indeed, 
a comparable increase in tolerance to norfloxacin was observed for the 
evolved clones, ruling out the possibility that a specific ampicillin-associated 
mechanism was responsible for the increase in tolerance (Extended Data 
Fig. 2b). Furthermore, the protective effect of the lag time was negated 
when the evolved clones were maintained in the exponential growth 
phase, a condition under which neither stationary phase nor the sub- 
sequent lag phase are reached (Fig. 2b). These results establish that in 
our cyclic protocol, the adaptive trait conferring tolerance to antibiotic 
treatments is the extended single-cell lag time. We term this adaptation 
‘tolerance by lag’(tbl), to distinguish it from tolerance that may be due 
to other factors. 

To quantify the extension of single-cell lag times at the population 
level, we measured the distribution of these times with our recently devel- 
oped ScanLag set-up’. Measuring the lag times of hundreds of cells in 
each case, we used this method to obtain single-cell lag-time distributions 
for the ancestral population and for clones isolated from the end popu- 
lations evolved under the three cyclic regimens (Fig. 2c, Supplementary 
Video 1 and Extended Data Fig. 3). For each of the empirical distributions, 
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Figure 2 | Changes in the single-cell lag-time distributions underlie 
antibiotic tolerance. a, Phase-contrast images of time-lapse microscopy of 
single bacteria from a stationary culture plated on fresh medium show the 
extended lag of the evolved strain. Times are indicated in hours and minutes. 
Scale bar, 5 um. b, Survival advantage of the evolved strain (tbl5a) over the 
ancestral strain (shown as fold change) when exposed to ampicillin, after 
stationary phase or after exponential growth. The dashed line indicates 
ancestral survival. Data are presented as the mean ~ s.d. of three independent 
experiments. c, Measurement of the lag-time distribution with ScanLag. The 
appearance time of colonies was continuously monitored by an automated 
scanner system'®. The histograms show the proportion of colony-forming units 
(CFUs) detected at each time point (mean = 0.78 h (ancestral), 2.1 h (tbi3a), 
4.9h (tbl5a), 10.3 h (tbl8a); median = 0.7 h (ancestral), 1.7 h (tbl3a), 3.7h 
(tbl5a), 9.2 h (tbl8a), with s.e.m. below 8% and sample sizes of n = 514, 747, 423 
and 168, respectively). Colonies appearing at later times grow at the same rate 
(Extended Data Fig. 2a). Stationary cultures were grown from a single colony 
isolated from the majority clone of the end populations. 


we calculated the mean single-cell lag time, 7)... We found that t),, increased 
from 1.0 + 0.2h for the ancestral strain to several hours for the evolved 
strains. Specifically, tag was found to be 3.4+ 1.0h, 5.1 + 0.2h and 
10 + 1h for the clones that evolved in response to cyclic 3, 5 and 8h 
exposures to antibiotic, respectively. These data show that the bacterial 
cultures under cyclic exposure to antibiotics adapted by extending the 
typical timescale of exit from the lag phase to approximately match the 
timescale of the antibiotic exposure. 

While delaying the exit from lag phase provides protection from the 
antibiotic, a delay that is too long comes at the expense of time that 
could be spent proliferating once the antibiotics have been washed out. 
Weexplored this trade-off computationally, using a simple population- 
dynamics model that describes the growth and death of cells under cyclic 
antibiotic exposure (Fig. 3a—c and see Methods). By approximating the 
empirical single-cell lag-time distribution as exponential, the fitness of 
a population is a function of two parameters: the mean lag time, tag, 
and the antibiotic-exposure time, T,. We used the model to calculate 
the value of t),, that maximizes the fitness for a fixed T,, and we found 
that the optimal lag time increases linearly with T,, which is in good 
agreement with the empirical results (Fig. 3d). 

We examined the genetic basis of the tb! phenotype by sequencing clones 
derived from the evolved populations (see Supplementary Methods)”. 
In each clone, few mutations were found, with a total of eight mutations 
across all sequenced clones being confirmed by Sanger sequencing. All 
mutations were in coding regions and were non-synonymous, suggest- 
ing that the affected proteins have a functional role in the tbl pheno- 
type. To explore this possibility, we restored the wild-type alleles in the 
evolved strains and identified which genes restored the wild-type lag 
time; we defined these genes as tbl genes (Extended Data Fig. 4). Of the 
six genes affected by the eight mutations, three were found to be tbl genes 
by this definition (Extended Data Tables 1 and 2). Notably, we observed 
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Figure 3 | Optimization of lag time. a, Schematic illustration of the processes 
in the theoretical model for the optimization of lag time. The fate of bacteria is 
determined by their growth state and by the environment. The non-growing 
phase (lag) protects the bacteria from the antibiotics. b, c, Non-growing 
bacteria switch to the growing state with a typical timescale of t,,. The trade-off 
of extending the lag time is illustrated by two strains: one with a short ty. (grey), 
and the other with a long ty,, (red). In the presence of antibiotics, the longer Tya. 


different mutations within the same gene in different clones, indicating 
that these genes are under strong selection. All tb] genes were found to 
have fixed mutations in the evolved populations (Extended Data Table 3). 

The tbl gene candidates are associated with several pathways. Two of 
these pathways, toxin-antitoxin modules’*”” and aminoacyl-transfer 
RNA synthetase” ~’, have been implicated in increased persistence. Inter- 
estingly, quantitative analysis has shown that toxin—antitoxin modules 
are a network motif that is ideally suited to set the timescale of the single- 
cell lag-time distribution’’. In contrast to these pathways, no relation- 
ship between the third emerging pathway, prs, and either lag or tolerance 
has been described in the literature. More experiments are needed to 
understand the relationship between the essential prs gene and the tbl 
phenotype. 

The evolution of the lag-time distribution is a remarkable demon- 
stration of an adaptation not to the specific nature of an environmental 
stress but to its duration. Not only does this finding illustrate that tem- 
poral parameters associated with growth can be readily changed during 
evolution”, but also the correspondence between stress duration and 
the timescale of the evolved phenotypes shows how fine-tuned this adap- 
tation can be. It is also notable that the malleable trait here is the lag 
time. Lag is commonly viewed as an inevitable delay in adjustment to 
new conditions imposed by metabolic constraints. Our study shows 
this phenotype in a different light: extended lag time as an advantageous 
trait for avoiding growth in adverse conditions and a trait that can evolve 
to accommodate selective pressures. 

In fact, it is the entire distribution of single-cell lag times that sets the 
fitness of the population, and it is the distribution as a whole that is being 
shaped during evolution. In a short period, the populations in this study 
adapted to antibiotic pressure by extending the single-cell lag-time dis- 
tribution (Fig. 2c). While the shift in the mean lag time is beneficial, this 
is not the case for the corresponding increase in variance. Indeed, for 
our regimen, maximum fitness is reached if all cells exit lag together, at 
the end of the antibiotic-exposure period. The observed increase in both 
the mean and the variance of the lag-time distribution is interesting and 
may indicate that those parameters are constrained at the molecular 
level to evolve concomitantly. Such an increase in variance could itself 
reflect past selection for a bet-hedging strategy**”°, as variance in the 
single-cell lag times does make sense when the duration of the antibiotic- 
exposure interval is not completely predictable. More generally, the time- 
scales and stochasticity of the environmental sequence of conditions” 
may shape the single-cell lag-time distribution beyond changes in the 
mean and variance”®, as in the case of type I persistence to antibiotic 
treatments, in which onlya fraction of the population is tolerant through 
having a long lag time’’. It should be noted that in our evolved strains, 
the whole population becomes tolerant (Extended Data Fig. 1). 


420 | NATURE | VOL 513 | 18 SEPTEMBER 2014 


Tlag = Tag 


—— Model 


—-—- Ancestral 
tbi3a 
tbi5a 
tbi8a 


Time 


reduces the killing and increases tolerance (b). In the absence of antibiotics, 
the longer tig delays the resumption of growth and bears a fitness cost 

(c). d, Lag time as a function of duration of antibiotic treatment, T,. The 
empirical mean lag-time dependence (circles) of the evolved thi clones follows 
the model predictions for lag-time optimization (solid line). Data are presented 
as the mean = s.d. of three independent experiments. The dashed line is the 
mean lag time of the ancestral strain. 


From a clinical point of view, tolerance by lag presents a major challenge”: 
in contrast to the specificity of resistance, the tbl phenotype confers a 
survival advantage against a broad spectrum of drugs and stresses. The 
revised understanding of antimicrobial tolerance offered here suggests 
new ways to eliminate bacterial populations, such as interfering with 
the precursor signal (stationary phase) from which bacteria anticipate 
stress. In addition, it is possible that tolerance facilitates the subsequent 
evolution of antibiotic resistance and that reducing tolerance might impede 
the emergence of antibiotic resistance. Understanding tolerance, resis- 
tance and the interplay between them in the adaptation of microorgan- 
isms to drugs is a critical goal in addressing the decreased efficacy of 
antibiotics”. 


METHODS SUMMARY 

Evolutionary protocol of cyclic exposure to antibiotics. Evolution under cyc- 
lic antibiotic exposure consisted of three steps. First, an overnight culture (0.5 ml; 
1 X 10° bacteria) was resuspended in 50 ml LB broth Lennox (LBL) supplemented 
with 100 pg ml” * ampicillin (Sigma) and incubated at 37 °C with shaking at 300 r.p.m. 
for T, hours (T, = 3, 5 or 8h). Second, the antibiotic-containing medium was removed 
by washing twice in LBL (10 min centrifugation at 1,400g). Last, the culture was 
resuspended in 1 ml fresh LBL and grown overnight (for approximately 20, 18 and 
15h, for the T, = 3, 5 and 8 h cultures, respectively) at 37 °C with shaking. It should 
be noted that nearly all of the surviving cells were kept from cycle to cycle, thus min- 
imizing drift; this is in contrast to typical serial dilution evolution experiments, in 
which most of the culture is discarded”. 

Single-cell lag-time distribution measurement with ScanLag. Overnight cul- 
tures grown from a single colony were plated at appropriate dilutions on solid LBL 
agar medium supplemented with 12.5 .g ml’ chloramphenicol (Sigma). A custom, 
automated scanner array set-up was used to monitor the appearance of thousands 
of colonies on plates over several days and to extract lag-time distributions by auto- 
mated image analysis’®. Single-cell microscopic observations confirmed that the delay 
in colony appearance was due to an extended lag. 

Theoretical model. The model describes a bacterial population consisting of two 
phenotypes, lagging (L) and growing (G) bacteria, whose dynamics are governed by 
first-order, linear differential equations: G = « (G/ Tgrow) + (L/Tiag) and L = —(L/ Tag)» 
where 1/Thag is the rate at which lagging bacteria switch to growth and 1/Tgrow is the 
growth rate. The outcome depends on the environment and is captured by the 
parameter a: in the absence of antibiotics, division leads to population growth and 
a = 1; however, when antibiotics are present, the cells that try to divide die, and 
a=-l1. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Genome sequencing of normal cells reveals 
developmental lineages and mutational processes 
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The somatic mutations present in the genome of a cell accumulate 
over the lifetime of a multicellular organism. These mutations can 
provide insights into the developmental lineage tree’, the number 
of divisions that each cell has undergone and the mutational pro- 
cesses that have been operative”. Here we describe whole genomes 
of clonal lines’ derived from multiple tissues of healthy mice. Using 
somatic base substitutions, we reconstructed the early cell divisions 
of each animal, demonstrating the contributions of embryonic cells 
to adult tissues. Differences were observed between tissues in the num- 
bers and types of mutations accumulated by each cell, which likely 
reflect differences in the number of cell divisions they have under- 
gone and varying contributions of different mutational processes. 
If somatic mutation rates are similar to those in mice, the results 
indicate that precise insights into development and mutagenesis of 
normal human cells will be possible. 

Somatic mutations in normal cells from multicellular organisms can 
provide insights into the origins and past experiences of each cell*. The 
developmental lineages of individual cells may be reconstructed using 
insertion or deletion mutations at short tandem repeats, as has been 
explored previously’*"' (Supplementary Discussion). Complete recon- 
struction of the bifurcating cell division tree requires at least one somatic 
mutation in the two daughter cells arising from each cell division. If 
mutations are generated during mitotic DNA replication, the number 
of divisions that a cell has undergone may be reflected in the number of 
mutations present”. Finally, the processes of DNA damage and/or repair 
experienced by different cell types may be manifested in distinct muta- 
tional signatures within the catalogues of mutation in each cell’. 

Single-cell whole-genome DNA sequencing has the potential to elu- 
cidate these aspects of the biology of normal cells. However, technologies 
are still under development” and are often characterized by incomplete 
genome coverage, suboptimal mutation sensitivity and substantial error 
rates for most mutation types. Errors mislead lineage reconstruction, 
distort mutational signatures and cause mis-estimation of numbers of 
mitoses undergone. 

We therefore derived clonal lines from normal mouse cells by using 
organoid technology*’*”’ as an alternative to single-cell analysis. Twenty- 
five organoid lines were obtained from the stomach, small bowel and 
large bowel of two mice (mouse 1 aged 116 weeks, and mouse 2 aged 
98 weeks) and from the prostate of mouse 2 (Supplementary Table 1). 
The whole genome was sequenced of each line and the polyclonal tail 
of each mouse, and extensive validation was performed to obtain high- 
quality catalogues of somatic mutations. After alignment to the refer- 
ence mouse genome, sets of somatic base substitution mutations were 
obtained by comparing each single-cell clone with all other clones from 
the same mouse, and with the tail (see Methods). Analyses of read 
count frequencies of the somatic substitutions indicated that they were 


heterozygous and that very few subclonal mutations, which may have 
arisen in vitro, were captured (Extended Data Fig. 1, Supplementary 
Discussion). 

To reconstruct the early developmental lineage tree of each mouse 
we searched for mutations present in at least two organoids and absent 
from at least one, applying computational and manual approaches 
(Methods and Extended Data Figs 2 and 3). All such putative early- 
embryonic mutations were further assessed by re-sequencing in every 
organoid, yielding 35 from the two mice (Supplementary Table 2). The 
phylogenetic lineage was then developed into a hypothetical tree of early 
embryonic cell divisions (Methods and Extended Data Fig. 3). Of the 
23 cell divisions requiring reconstruction to generate a simple bifur- 
cating tree for all the individual cell clones from both mice, 17 were 
reconstructable and 6 were not (Fig. 1 and Extended Data Fig. 4). Thus 
the intrinsic substitution mutation rate per cell division in early mouse 
embryos was almost sufficient to reconstruct the tree. Mutations defin- 
ing the first four cell generations were found in the tails of the two mice, 
confirming that they occurred before formation of the three germ layers 
during gastrulation because they were present in endoderm (the pre- 
cursor of tissues from which the clonal organoids were derived) and 
either mesoderm or ectoderm, or both (which contribute to the tail). 

The earliest reconstructed cell division in each tree may represent 
the first division of the fertilized egg. However, it is possible that earlier 
cell divisions took place, generating daughter cells that were not ances- 
tors of any of the 25 single cells sampled in the two mice. To assess this 
possibility we examined the sequencing read counts of mutations of 
the putative first two daughter cells (generation I, cells ‘b’ and ‘c’) in the 
polyclonal tail samples of each mouse. Although we cannot exclude the 
possibility that a small proportion was from earlier progenitors, the results 
are compatible with all cells in the tail samples being derivatives of the 
two earliest reconstructed cells in each mouse, and therefore that this 
first reconstructed cell division represented division of the fertilized egg 
(Fig. 2). It is also possible that daughter cells from earlier cell divisions 
did not contribute at all to the adult mice. If so, we would be unable to 
detect their existence. 

Several features of normal mouse development can be extracted from 
these reconstructions. The two daughter cells of early embryonic cell divi- 
sions often contribute unequally to adult tissues. For example, one of 
the daughter cells in the first generation of reconstructed cell divisions 
in mouse 1 (cell ‘b’ in Fig. 1a) is the progenitor for 12 organoid clones, 
whereas the other is the progenitor of just one (cell ‘c’). Examination of 
the tail from this animal, using the sequencing read counts of the muta- 
tions found in the first two daughter cells, again showed unequal con- 
tributions of the two progenitors: ~75% from one and ~30% from the 
other (cells ‘b’ and ‘c’ in Fig. 2a). A similar degree of inequality of con- 
tribution is shown for the first reconstructed cell division of mouse 2 
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Figure 1 | Reconstructed phylogenetic trees of cells from early mouse 
embryos. a, Mouse 1. b, Mouse 2. Each white-filled large circle represents 
an embryonic cell that is defined by a unique combination of mutations. 
Each mutation is represented by a number inside the white circles. Yellow 
highlighted numbers: mutations acquired during most recent mitosis. Letters 
next to white circles: identifiers of each embryonic cell. Roman numerals 


(cells ‘b’ and ‘c’ in Fig. 2b). These observations are consistent with results 
obtained by other approaches’*”’. This asymmetry propagates beyond 
the first cell division in both mice (Fig. 2). 

At these early stages of embryo development, individual cells contribute 
to multiple tissues. For example, at least three of the four reconstructed 


a Mouse 1 


0@@ @) ©@ )@@ @ ) 


Corpus = Pylorus Proximal Proximal Distal 
Stomach Small bowel Large bowel 
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indicate each reconstructed cell generation. c, d, Colour-filled smaller circles 
represent individual organoids derived from different anatomical regions of 
mouse 1 (c) and mouse 2 (d). Dashed lines connect each organoid with its last 
identifiable embryological precursor. An unknown number of cell generations 
lies between each organoid and its last identifiable embryological precursor. 


cells from the two mice at generation I are ancestors of all tissues sam- 
pled, including the tail, and are thus probably contribute to endoderm, 
mesoderm and ectoderm (mouse 1, cell ‘b’; mouse 2, cells ‘b’ and ‘c’; 
Figs la, b and 2a, b). A similarly broad tissue contribution is made by 
at least one reconstructed cell that has undergone two cell divisions 
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Figure 2 | Contributions of early embryonic cells to adult tail cell 
populations. Early embryonic phylogenetic trees of cells from mouse 1 (a) and 
mouse 2 (b), as in Fig. 1. The proportional contribution of each embryonic 
precursor cell to the population of cells in the tail is represented by the 
proportion of the circle area coloured red. This contribution was determined by 


assessment of the read count, in the tail, of the most recently acquired 
mutation(s) in each early embryo cell. Grey embryonic precursor cells did not 
acquire new mutations in the most recent mitosis, so their contributions to the 
tail cannot be measured directly. 
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(mouse 2, cell ‘d’; Fig. 1b). Cell ‘’, which has undergone three cell divi- 
sions in mouse 2, is the progenitor of prostate and large bowel cells and 
cells in tail tissues. The most distant early embryonic cells from the fer- 
tilized egg that we can reconstruct have undergone five cell divisions 
(mouse 1, cells “p’ and “q’; Fig. 1a), and one of these contributes to both 
the distal colon and mid small intestine. Similarly, each organ derives 
from multiple early embryonic progenitor cells. For example, prostate 
in mouse 2 is contributed to by both reconstructed cells in generation I 
(cells ‘b’ and ‘c’ in Fig. 1b) and by at least three of the four cells in gen- 
eration II (cells ‘d’, ‘e’ and ‘f in Fig. 1b). 

Mutations in individual cells can also provide insights into the number 
of cell divisions and the mutational processes that have been operative 
in normal tissues. To explore these questions, we extracted 25 sets of 
somatic substitutions (6,714 in total) unique to individual clones (with 
a true positive rate of 92%, as assessed by validation of 743 variants, 
anda mean sensitivity of 50% for detecting heterozygous substitutions 
(Supplementary Table 1)). Of the four tissues sampled, small-bowel 
stem cells have acquired significantly more base substitutions than any 
other tissue, and prostate and stomach the fewest (Fig. 3b; see Methods 
for an explanation of statistical test and P values). This remains true if 
the analysis is restricted to CT substitutions at CpG dinucleotides, 
which are predominantly due to a ubiquitous, intrinsic mutation pro- 
cess depending on the deamination of 5-methylcytosine to thymine’. 
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These differences could be due to different mutation rates per cell divi- 
sion in different tissues, different numbers of cell divisions in the lineage 
from fertilized egg to adult stem cell in different tissues, or both. Using 
other approaches, it has been estimated that the rate of small-bowel stem 
cell division is greater than that of cells from colon or stomach”, con- 
sistent with the mutation counts demonstrated here, suggesting that 
the number of somatic substitutions is acting as a ‘cell division clock’. 
The murine small-bowel Lgr5-positive stem cell has been reported to 
divide every 21.5h (ref. 21). Thus, the number of substitution muta- 
tions arising in each cell division in small-bowel stem cells is ~1.1 per 
cell division (Methods), similar to the rate of ~ 1.5 per cell division dur- 
ing early embryogenesis (Supplementary Discussion). The burden of 
mutations was higher in zones of repressed chromatin (q < 0.00001) 
and late-replicating regions (P = 0.000034), patterns previously reported 
for human cancer**”* (Extended Data Fig. 5). There was no statistically 
significant difference in mutation burden between the two mice. 
Comparison of the six subclasses of base substitution between the 
four mouse tissues sampled revealed differences in mutation patterns. 
In particular, the proportion of C—A substitutions in small-bowel stem 
cells was greater than in other adult tissues and in the 35 aggregated early 
embryonic mutations (Fig. 3a, c), which were predominantly CT tran- 
sitions enriched at CpG dinucleotides. Analysis by non-negative matrix 
factorization”, incorporating the immediate 5’ and 3’ sequence context 
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Figure 3 | The number and spectrum of substitution mutations in 
individual organoids. a, The spectrum of base substitution mutations in each 
organoid. b, Left: the absolute number of genome-wide substitutions. When 
adjusted for sensitivity, the range of the mutation burden per organoid genome 
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is 179-1,190 (mean 609) substitutions (see Supplementary Table 1). Right: the 
number of CpG->TpG substitutions. Grey bars, mouse 1; white bars, mouse 2. 
c, Mutational spectrum of embryonic variants. 
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of each mutated base, extracted two underlying mutational signatures. 
One was characterized predominantly by CT mutations (signature 2 
in Extended Data Fig. 6). The other had a major feature of C—A substi- 
tutions at XpCpT trinucleotides in addition to C+>T mutations (signature 
1 in Extended Data Fig. 6). Mutations acquired in vitro by small-bowel 
organoid stem cells had a different mutational signature. Sequencing 
of subclones of cell populations expanded from two small bowel single 
stem cells that had been exposed to a period of culture revealed a dis- 
tinct mutational signature characterized by T—G mutations enriched 
at XpTpT trinucleotides (Extended Data Fig. 7). The data indicate that 
multiple mutational processes are operative in normal cells in vivo and 
in vitro and that the degree of exposure differs across tissues. The mech- 
anism underlying the signature characterized by CA mutations is 
unclear. However, one possible cause is mutagenesis by reactive oxygen 
species, which have been reported to generate C>A mutations”. 

Catalogues of somatic mutations act as ‘archaeological’ records reflect- 
ing the past experiences of cancer cells”*. Our results illustrate the insights 
that they can provide into the life histories of normal cells. Sequencing 
of larger numbers of individual cells from a wider range of tissues will 
allow the precise reconstruction of cell lineages extending into later stages 
of embryogenesis with further insights into tissue-specific development, 
mutational processes and mutation rates. Studies of multiple animals 
will reveal variability between individuals and relationships with age, 
disease and environmental exposures. If somatic mutation rates in humans 
are similar to those in mice, the results indicate that application of this 
approach to human cells is a tractable endeavour. 


METHODS SUMMARY 


Two homozygous C57B16 male mice (mouse 1 and mouse 2) with Lgr5-ki, derived 
from lines established in 2006/2007, were euthanized at 116 and 98 weeks of age, 
respectively. Clonal organoid lines were grown from single clonal glands (stomach)'*"””’, 
single clonal crypts (small intestine’*”” and colon’*"”) or single cells (prostate; W.K. 
and H.C., manuscript in preparation). DNA from organoids and tail tissue from 
each mouse were whole-genome sequenced”*. Somatic single-nucleotide substitu- 
tions unique to each organoid were called using the CaVEMan algorithm*’, compar- 
ing each organoid with the tail of the same mouse. The precision of these mutations 
was 92%, as determined by massively parallel sequencing of PCR amplicons target- 
ing 743 (11%) randomly selected substitutions. Early embryonic mutations may be 
shared by multiple organoids and thus can be distinguished from mutations occur- 
ring later in development or after birth. However, they may also be present in the 
tail of the same mouse at significant read counts and thus be removed in compar- 
isons of organoids against the tail. To derive comprehensive catalogues of embry- 
onic mutations, we therefore compared each organoid against the tail of the other 
mouse. Every potential embryonic variant thus detected was subjected to valida- 
tion by capillary sequencing in every organoid and control tissue. We constructed 
lineage trees from mutations that were confirmed to be somatic and shared by at 
least two organoids. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Bidirectional switch of the valence associated with a 
hippocampal contextual memory engram 


Roger L. Redondo!*, Joshua Kim!*, Autumn L. Arons’?, Steve Ramirez’, Xu Liu’? & Susumu Tonegawa!* 


The valence of memories is malleable because of their intrinsic recon- 
structive property’. This property of memory has been used clinically 
to treat maladaptive behaviours”. However, the neuronal mechan- 
isms and brain circuits that enable the switching of the valence of 
memories remain largely unknown. Here we investigated these mech- 
anisms by applying the recently developed memory engram cell- 
manipulation technique**. We labelled with channelrhodopsin-2 
(ChR2) a population of cells in either the dorsal dentate gyrus (DG) 
of the hippocampus or the basolateral complex of the amygdala (BLA) 
that were specifically activated during contextual fear or reward con- 
ditioning. Both groups of fear-conditioned mice displayed aversive 
light-dependent responses in an optogenetic place avoidance test, 
whereas both DG- and BLA-labelled mice that underwent reward 
conditioning exhibited an appetitive response in an optogenetic place 
preference test. Next, in an attempt to reverse the valence of memory 
within a subject, mice whose DG or BLA engram had initially been 
labelled by contextual fear or reward conditioning were subjected 
to a second conditioning of the opposite valence while their original 
DG or BLA engram was reactivated by blue light. Subsequent opto- 
genetic place avoidance and preference tests revealed that although 
the DG-engram group displayed a response indicating a switch of 
the memory valence, the BLA-engram group did not. This switch 
was also evident at the cellular level by a change in functional con- 
nectivity between DG engram-bearing cells and BLA engram-bearing 
cells. Thus, we found that in the DG, the neurons carrying the mem- 
ory engram of a given neutral context have plasticity such that the 
valence of a conditioned response evoked by their reactivation can 
be reversed by re-associating this contextual memory engram with 
a new unconditioned stimulus of an opposite valence. Our present 
work provides new insight into the functional neural circuits under- 
lying the malleability of emotional memory. 

The amygdala can encode both negative and positive valence*'° and 
the DG encodes contextual information**””. It is unknown whether 
the DG drives expression of memories irrespective of the valence of the 
unconditioned stimulus (US). Therefore we investigated the roles of DG 
and BLA engrams in determining the valence of contextual memories 
and its possible switch. We targeted engram-bearing cells by infecting 
DGand BLA neurons of c-fos-tT A male mice with AA Vo virus express- 
ing, under the control of the tetracycline response element (TRE), ChR2 
and mCherry fusion protein (DG and BLA ChR2 mice, respectively) or 
mCherry-only (DG mCherry-only and BLA mCherry-only mice, respec- 
tively) (Fig. 1a, b and Methods). This method targets the expression of 
ChR2 to neurons in which the immediate early gene c-fos is activated 
during the encoding of a memory in the absence of the antibiotic doxy- 
cycline (dox) in the diet*’*. A similar proportion of neurons expressed 
ChR2 after encoding a fear memory (foot shock) or a reward memory 
(interaction with a female mouse in the home cage) (Fig. 1c). To test the 
capability of the ChR2-labelled neurons to drive an aversive or appeti- 
tive response, we developed two real-time optogenetic place memory tests: 
the optogenetic place avoidance (OptoPA) test for assessing aversive 


behaviour and the optogenetic place preference (OptoPP) test for assess- 
ing appetitive behaviour (see Methods and the characterization of these 
tests in Extended Data Fig. 1). 

On day 1 of the protocol, the mice underwent a habituation session 
while on doxycycline in the OptoPA or OptoPP test (Fig. 1d, f). During 
habituation, light activation had similar effects on both the ChR2- 
expressing mice and the mCherry-only controls (Fig. le, g) and was 
not sufficient to produce a change in preference (Extended Data Fig. 2). 
On day 3, off doxycycline, mice habituated to the OptoPA test were fear 
conditioned in context A (fear memory group), whereas the mice ha- 
bituated to the OptoPP test were reward conditioned, by spending 2h 
in context B with one female mouse (reward memory group). As a neg- 
ative control, groups of DG ChR2 or BLA ChR2 mice did not receive the 
US (foot shock in context A or female exposure in context B) on day 3 
(DG ChR2, no US on day 3 or BLA ChR2, no US on day 3 mice). At the 
end of day 3, all animals were returned to a doxycycline diet, closing the 
time window for labelling for the remainder of the experiment. On day 5, 
both DG ChR2 and BLA ChR2 mice of the fear memory group exhibited 
greater aversive responses in the OptoPA test than the corresponding 
mCherry-only and ChR2, no US on day 3 mice (Fig. le). Both DG ChR2 
and BLA ChR2 mice of the reward memory group showed greater appe- 
titive response than corresponding mCherry-only mice (Fig. 1g), DG 
ChR2, no US on day 3 mice or BLA ChR2, no US on day 3 mice in the 
OptoPP test (Fig. 1g). Therefore, the US is necessary on day 3 for the 
engram neurons to drive the appropriate response on day 5 tests. 

To investigate whether the valence of the memory associated with the 
DG or BLA engram can be reversed, we conducted within-subject lon- 
gitudinal experiments. For this purpose, both the fear and reward mem- 
ory groups were returned to a doxycycline diet immediately after day 3 
conditioning, preventing the expression of ChR2 by other neurons that 
may upregulate c-fos promoter-driven ChR2 after day 3. In a ‘fear-to- 
reward’ experiment (Fig. 2a, b), the fear memory group was subjected 
to an OptoPA test on day 5, and as expected, DG-ChR2 and BLA-ChR2 
mice exhibited aversive responses that were greater than DG mCherry- 
only mice and BLA mCherry-only mice, respectively (Fig. 2b). On day 7, 
these mice received light stimulation while interacting with 2 female 
mice in their home cage. This procedure on day 7 is hereafter referred 
to as ‘induction’. Another group of DG ChR2 mice received light stim- 
ulation but no female mice (DG ChR2, no US on day 7 mice). On day 9, 
the OptoPP test was used in the fear-to-reward experiment to test whether 
the neurons activated by the light stimulation during the induction pro- 
cedure could now drive an appetitive response. The DG ChR2 mice 
showed a greater appetitive response than the DG mCherry-only or DG 
ChR2, no US on day 7 mice (Fig. 2b). Light reactivation of the original 
fear memory engram-bearing cells labelled in the BLA (BLA ChR2 mice) 
failed to exhibit an appetitive response in the day 9 OptoPP test. For the 
fear-to-reward experiment, we introduced another protocol for DG ChR2 
and BLA ChR2 mice consisting of OptoPA tests on day 5 and day 9. The 
DG ChR2 mice displayed lower aversive responses on day 9 than the 
day 5 test, whereas the DG ChR2, no US on day 7 mice and the BLA ChR2 
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Figure 1 | Fear and reward engram reactivation, both in the DG and the 
BLA, drives place avoidance and place preference, respectively. a, b, c-fos- 
tTA mice were injected with AAVo-TRE-ChR2-mCherry or TRE-mCherry 
and implanted with optical fibres bilaterally targeting the DG (a) or the BLA 
(b). c, Similar engram labelling in the DG (t;3 = 0.42, NS (not significant)) and 
BLA (ty¢ = 0.35, NS) after fear (DG n = 16; BLA n = 14) and reward (DG 
n= 19; BLA n= 14) conditioning. d, Fear memory group experimental 
protocol. e, On day 1, difference scores (time in target zone during the on phase 
minus time in target zone during baseline phase) (Extended Data Fig. 1) 
were similar across all DG subgroups (F2,101 = 0.76, NS) and across all BLA 
subgroups (F,72 = 0.03, NS). On day 5, difference scores were lower in DG 
ChR2 (n = 48) and BLA ChR2 mice (n = 21) compared to the corresponding 
mCherry-only (DG n = 39; BLA n = 27) and DG or BLA ChR2, no US on 
day 3 mice (DG n = 17; BLA n = 27) (DG Fy 19; = 7.99, P< 0.001; BLA 

F.72 = 4.12, P< 0.05). f, Reward memory group experimental protocol. g, On 
day 1, difference scores were similar across all DG subgroups (F211; = 0.02, NS) 
and across all BLA subgroups (F233 = 0.04, NS). In the day 5 OptoPP test, 
difference scores were greater in DG ChR2 (n = 54) and BLA ChR2 mice 

(n = 35) compared to corresponding mCherry-only (DG n = 36; BLA n = 31) 
and DG or BLA ChR2, no US on day 3 mice (DG n = 24; BLA n = 21) 

(DG Fy11; = 9.76, P< 0.001; BLA Fy,g3 = 9.12, P< 0.001). Significance for 
multiple comparisons: *P < 0.05; **P < 0.01; ***P < 0.001. Results show 
mean + s.e.m. 


mice showed similar aversive responses on day 5 and day 9 (Fig. 2c), indi- 
cating that the DG ChR2 mice not only acquired reward memory by the 
induction procedure but also lost much of the fear memory acquired 
previously. In contrast, BLA ChR2 mice neither acquired a reward mem- 
ory nor lost the fear memory. 

In the reward-to-fear experiment (Fig. 2d, e), DG ChR2 and BLA 
ChR2 mice as well as the DG ChR2, no US on day 7 mice displayed 
greater appetitive responses in the OptoPP test on day 5 than the cor- 
responding mCherry-only mice (Fig. 2e). On day 7, these mice received 
light stimulation either while being subjected to fear conditioning in 
context A (DG ChR2 and BLA ChR2 mice) or while exploring context 
A without foot shocks (DG ChR2, no US on day 7 mice). DG ChR2 mice, 
but not DG ChR2, no US on day 7 mice displayed a greater aversive 
response in the day 9 OptoPA test than DG mCherry-only mice (Fig. 2e). 
Among several groups of mice, only in the DG ChR2 mice did the same 
neuronal ensemble whose reactivation led to an appetitive response 


LETTER 


__ #358} Im 
eS 85 Bp 


Induction 


Day 9 
* 
Fl 
Ff + 
B 304 11 oF ) 
oO oO 
5 -10 5 - 
8 il 20 8 10 
@ -20 8 -20 
5 10 5 
& -30 & -30 
[ay 0 [ay 
-40 -40 
-50 -10 | -50 
LSP SP PSY 
i DG chr2 DG mCherry-only IE DG Chr2, no US on day 7 
i BLAChR2 {)) BLA mCherry-only 
d y NN yp _, ,, 
On dox Off dox On dox Induction 
OptoPP OptoPP 
; Day 1 ; Day 3 Day 5 ! Day 7 Day 9 


e Day 5 ae Day 5 |OptoPP 
Day 9 |OptoPP 


407 ye 10 40 
tl *e 
a - ro 0 a es 
nan 
mm @ ay) 1 
-10 
20 | 
20 
10 
-30 
= O 10 
a 404 | 
¥* 
-10 -50 


0 
SP PP SP 


ayo [SORA] 


Difference score (s) 
a 
oO 
Difference score 


Figure 2 | The valence associated with the DG engram is reversed after 
induction with the unconditioned stimulus of opposite value. a, The fear-to- 
reward experimental protocol. b, On day 5, difference scores of DG ChR2 

(n = 16); DG ChR2, no US on day 7 (n = 20); and BLA ChR2 mice (n = 19) 
were lower compared to corresponding mCherry-only mice (DG n = 27; BLA 
n = 27) (DG Fy59 = 6.16, P< 0.01; BLA ty = 2.73, P< 0.01). In the day 9 
OptoPP test, difference scores of DG ChR2 mice were greater than the control 
mice (F)69 = 4.4, P< 0.05). Difference scores of BLA ChR2 mice were 
similar to those of BLA mCherry-only mice (t,4 = 0.16, NS). ¢, On the day 9 
OptoPA test, DG ChR2 mice (n = 12) showed a less aversive response 
compared to day 5, whereas both DG ChR2, no US on day 7 (m = 16) and BLA 
ChR2 (n = 17) mice showed similar difference scores on these days 

(Fy42 = 5.42, P< 0.05). d, Reward-to-fear experimental protocol. e, On day 5 
OptoPP test, difference scores of DG ChR2 (n = 17); DG ChR2, no US on 
day 7 (n = 29); and BLA ChR2 mice (n = 30) were greater compared to the 
corresponding mCherry-only mice (DG n = 27; BLA n = 29) (DG Fy, = 8.97, 
P<0.001; BLA ts7 = 2.85, P< 0.01). On day 9, difference scores of DG ChR2 
mice were lower compared to the control mice (F 7; = 3.20, P< 0.05) and 
difference scores of BLA ChR2 mice were similar to those of BLA mCherry- 
only mice (ts7 = 0.49, NS). f, On the day 9 OptoPP test, DG ChR2 mice (n = 18) 
showed a reduced appetitive response compared to day 5 OptoPP test, whereas 
both DG ChR2, no US on day 7 (n = 21) and BLA ChR2 mice (n = 18) 
showed similar difference scores on day 9 and day 5 (Fy,54 = 6.58, P< 0.05). 
Significance for multiple comparisons: *P < 0.05; **P < 0.01; ***P < 0.001. 
Results show mean + s.e.m. 
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during the OptoPP test on day 5 produced an aversive response on day 9. 
For the reward-to-fear experiment, we introduced another protocol for 
DG ChR2 and BLA ChR2 mice consisting of OptoPP tests on day 5 and 
day 9. The DG ChR2 mice displayed lower appetitive responses on 
day 9 than the day 5 OptoPP test, whereas the DG ChR2, no US on day 
7 mice and the BLA ChR2 mice showed similar appetitive responses on 
day 5 and day 9 (Fig. 2f), mirroring the results obtained by the fear-to- 
reward experiment described above (Fig. 2c). 

We next investigated the effect of our manipulations at the cellular 
level (Fig. 3a, b; see Methods for the experimental design details). Three 
groups were defined by whether, on day 3 of the protocol, mice experi- 
enced an event of the opposite valence without engram reactivation 
(light US"), induction (light * US*), or received optogenetic stimu- 
lation but no US delivery (light US”). On day 5, the effect of DG en- 
gram reactivation on the BLA was assessed. The proportions of cells 
(mCherry* / DAPI") (Fig. 3c, d) labelled on wee 1 as well as the pro- 
portion of cells activated on day 5 (GFP* / DAPI )(Fig. 3e, f) were sim- 
ilar across experimental groups in both the DG and BLA. The proportion 
of DG engram cells labelled on day 1 that were light-reactivated on day 5 
(GFP* mCherry* / mCherry~) was similar in all groups (Fig. 3g) and 
above chance (Fig. 3h). In the light” US* and the light* US” groups, 
the proportion of BLA engram cells labelled on day 1 that were reacti- 
vated on day 5 (GFP *mCherry* / mCherry") by artificial reactivation 
of DG engram cells were similar (Fig. 3i, k) and significantly greater 
than chance (Fig. 3j). In the light* US* group, this proportion, though 
greater than chance overlap (Fig. 3j), was significantly lower compared 
to the light” US* and the light” US” groups (Fig. 3i, k). This suggests 
that the induction procedure decreased the ability of the DG engram 
to activate the BLA engram, indicating a change in their functional 
connectivity'*">. 

Using the reward-to-fear scheme, we next investigated the effect of 
the light reactivation of a previously labelled (that is, day 3) reward mem- 
ory engram on subsequent encoding of a fear memory (that is, day 7) 
and whether this procedure affects the recall of the fear memory by 
natural cues (day 11) (Fig. 4a). On day 7, during induction, DG ChR2 
and BLA ChR2 mice displayed significantly lower freezing than DG 
mCherry-only and BLA mCherry-only mice, respectively (Fig. 4b). In 
addition, DG ChR2 and BLA ChR2 mice displayed significantly lower 
freezing compared to DG ChR2, no US on day 3 and BLA Ch2, no US 
on day 3 mice, respectively on day 7 and day 11 (Fig. 4b), indicating that 
the reduced encoding and recall of the fear memory in DG ChR2 and 
BLA ChR2 depends on the rewarding experience on day 3. Finally, we 
studied the integrity of the originally acquired memory after the induction 
of a memory of the opposite valence. For this purpose, in the fear-to- 
reward scheme, we investigated the effect of the reward memory induc- 
tion on day 7 on the original fear memory by testing the freezing response 
to the original context on day 11 (Fig. 4c). DG ChR2 mice, but not DG 
mCherry-only and DG ChR2, no US on day 7 mice, showed a significant 
reduction in their freezing response even though these mice were never 
re-exposed to the original context between encoding of the original fear 
memory on day 3 and testing on day 11 (Fig. 4d). In contrast, BLA ChR2, 
but not BLA mCherry-only mice, showed elevated freezing during the 
day 11 test session compared to the encoding session on day 3, consis- 
tent with earlier observations that BLA engram reactivation leads to the 
sensitization of fear responses'*””. 

During the context A test on day 11, DG ChR2 mice spent more time 
sniffing, a behavioural response that increases in the presence of female 
cues and decreases after fear training’*”’ (corrected for freezing peri- 
ods) (see Methods), than any other groups of mice (Fig. 4e), suggesting 
that our reward memory induction procedure increased the positive 
memory valence associated with the test context (that is, context A) on 
day 11. 

Here we have shown that both the DG and BLA neurons activated 
during context-specific fear or reward conditioning can drive aversive 
and appetitive responses, respectively, upon optogenetic reactivation 
of these cells two days after training. This confirms our previous finding 
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Figure 3 | DG to BLA functional connectivity changes after induction. 

a, Injection sites and the optic fibre placement. b, Experimental protocol. On 
day 1, cells active during fear or reward experience were labelled. On day 3, on 
doxycycline, mice were randomly assigned to 3 groups: reward or fear 
conditioning without light reactivation (light US*) (n = 11); full induction 
protocol (light * US*) (n= 16); light stimulation but neither reward nor fear 
conditioning (light US~) (n = 16). On day 5, all animals received light 
stimulation in the DG for 12 min in a novel context (context C) before the 
brains were used for immunohistochemistry. c, d, Similar proportions of 
neurons were labelled by the fear or reward conditioning on day 3 in all groups, 
both in DG (F),4 = 0.77, NS) (c) and BLA (F2,49 = 2.40, NS) (d). e, £, Light 
delivery to the DG on day 5 led to the activation (GFP* / DAPI") of similar 
proportions of cells in DG (F2,49 = 0.21, NS) (e) and BLA (F2,49 = 0.06, NS) (f). 
g, h, Levels of reactivation (GEP* mCherry* f mCherry~) in the DG were 
similar across all groups (F>,49 = 0.61, NS) (g) and above levels of chance 
(one sample t-tests against chance overlap: light” ,US* ty) = 4.24, P<0.01; 
light* US* 5 = 8.56, P< 0.001; Light” US™ t)5 = 5.5, P< 0.001) (h). 

i, j, Levels of reactivation (GEP* mCherry~ i mCherry* ) in the BLA 

were lower in the light US* compared to light” US" and light* US~ 

(F2,40 = 11.82, P< 0.001), even though overlap levels (j) remained above 
chance (one sample t-tests: light ~ US* tio = 7.41, P< 0.001; light* ust 

tis = 2.33, P< 0.05; light* US” tis = 6.94, P< 0.001). k, Representative 
images of double immunofluorescence for GFP (green) and mCherry (red) 
in the DG and BLA. Significance for multiple comparisons: *P < 0.05; 

***P < 0.001. Results show mean ~ s.e.m. 


that these neurons have undergone enduring changes as a consequence 
of memory training, validating their engram-bearing nature. We have 
also shown that artificially reactivating contextual fear-labelled neurons 
in the DG, but not in the BLA, during a subsequent reward conditioning 
was sufficient to reverse the dominant valence associated with the memory. 
Reciprocally, artificially reactivating contextual reward-labelled neurons 
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Figure 4 | Memory induction alters naturally cued fear memory. a, Reward- 
to-fear scheme with context A fear memory tests on day 7 and day 11. b, On 
day 7, freezing levels during the last 3 min of the induction procedure were 
reduced in DG ChR2 (n = 18) and BLA ChR2 mice (n = 19) compared to 
corresponding mCherry-only (DG n = 21; BLA n = 20) and ChR2, no US on 
day 3 mice (DG n = 27; BLA n = 24) (DG Fy63 = 8.768, P< 0.001; BLA 
F609 = 8.49, P< 0.001). These reduced freezing levels remained on day 11 (DG 
Fy 63 = 6.25, P<0.01; BLA Fy69 = 6.86, P< 0.01). c, Fear-to-reward scheme 
with context A fear memory tests on day 3 and day 11. d, Compared to the 
last three minutes of the fear conditioning on day 3, only DG ChR2 mice 

(n = 27) showed a reduction of freezing responses on day 11 (interaction 
F415; = 8.48, P< 0.001). BLA ChR2 mice (” = 29) showed increased levels of 
freezing on day 11 compared to day 3. DG ChR2, no US on day 7 mice (n = 42) 
and mCherry-only mice (DG n = 32; BLA n = 29) did not show differences 
between day 3 and day 11. e, After correcting for the time spent freezing, 

DG ChR2 mice spent a larger proportion of time sniffing than any other group 
in context A on day 11 (F415) = 7.78, P < 0.001). Significance for multiple 
comparisons: *P < 0.05; **P < 0.01; ***P < 0.001. Results show 

mean = s.e.m. 


in the DG, but not in the BLA, during a subsequent contextual fear con- 
ditioning was sufficient to reverse the dominant valence associated with 
the original memory (Fig. 2). We conclude that the valence associated 
with the hippocampal memory engram is bidirectionally reversible. In 
contrast, the inability of the BLA engram to reverse the valence of the 
memory suggests that individual BLA cells are hardwired to drive either 
fear or reward memories rather than both. 

The reversal of the dominant valence associated with the DG mem- 
ory engram was also demonstrated at the cellular level. The hippocam- 
pal output has been shown to be sufficient to induce synaptic plasticity 
in the amygdala”, and post-training inactivation of the dorsal hippo- 
campus prevents context-dependent neuronal activity in the amygdala”’. 
In addition, studies have shown that the BLA can drive both aversive” 
and appetitive” responses. We thus predicted that driving the DG 
engram associated with a particular valence would result in firing of 
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the corresponding BLA engram-bearing cells active during encoding— 
a hypothesis that was supported by the data in Fig. 3 (light US"). We 
hypothesized that optogenetic reactivation of the DG engram-bearing 
cells during the presentation ofa US with a valence opposite to the orig- 
inal one would strengthen the connectivity, albeit indirectly, of these 
DG cells with a new subset of BLA neurons while weakening the con- 
nections established during the original learning. This hypothesis is sup- 
ported by the finding that the overlap of BLA neurons activated by 
stimulation of the DG engram-bearing cells was reduced after induc- 
tion compared to the overlap observed in no induction controls (Fig. 3i, k). 
The observation that the levels of c-fos activation were similar across 
groups suggests that a new population of BLA neurons was function- 
ally recruited in the light" US* group. 

By applying optogenetic manipulations to two interacting brain areas, 
our study provides a new type of neural circuit analysis that elucidates 
functional relationships between brain areas with respect to the expression 
of memories. Previously, others have shown that neurons that artificially 
upregulate CREB” and express TRPV1" in the BLA can be associated 
with fear. Also, randomly labelled populations of neurons in the piriform 
cortex can drive opposing behaviours after their stimulation is paired 
with a US of positive or negative valence”. Here, because our engram 
labelling technology allows for the targeting of neurons that were acti- 
vated during natural memory encoding to express ChR2, we were able 
to label a specific memory trace and monitor the behavioural response 
elicited by natural cues (Fig. 4a, b). Our present study provides a new 
insight into the functional neural circuit underlying the malleability of 
emotional memory by highlighting the importance of the plasticity in 
the hippocampal-amygdala connections. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Exonuclease-mediated degradation of nascent RNA 
silences genes linked to severe malaria 


Qingfeng Zhang**, T. Nicolai Siegel*"*+, Rafael M. Martins*, Fei Wang’, Jun Cao‘, Qi Gao*, Xiu Cheng”, Lubin Jiang®, 
Chung-Chau Hon*®, Christine Scheidig-Benatar”’, Hiroshi Sakamoto”*, Louise Turner’, Anja T.R.J ensen’, Aurelie Claes”, 


Julien Guizetti??, Nicholas A. Malmquist?? & Artur Scherf?? 


Antigenic variation of the Plasmodium falciparum multicopy var 
gene family enables parasite evasion of immune destruction by host 
antibodies’. Expression of a particular varsubgroup, termed upsA, 
is linked to the obstruction of blood vessels in the brain and to the path- 
ogenesis of human cerebral malaria* °. The mechanism determining 
upsA activation remains unknown. Here we show that an entirely 
new type of gene silencing mechanism involving an exonuclease- 
mediated degradation of nascent RNA controls the silencing of genes 
linked to severe malaria. We identify a novel chromatin-associated 
exoribonuclease, termed PfRNase II, that controls the silencing of 
upsA var genes by marking their transcription start site and intron- 
promoter regions leading to short-lived cryptic RNA. Parasites carry- 
ing a deficient PfRNase IT gene produce full-length upsA var transcripts 
and intron-derived antisense long non-coding RNA. The presence of 
stable upsA var transcripts overcomes monoallelic expression, re- 
sulting in the simultaneous expression of both upsA and upsC type 
PfEMP1 proteins on the surface of individual infected red blood 


cells. In addition, we observe an inverse relationship between tran- 
script levels of PfRNase ITand upsA-type var genes in parasites from 
severe malaria patients, implying a crucial role of PfRNase II in se- 
vere malaria. Our results uncover a previously unknown type of post- 
transcriptional gene silencing mechanism in malaria parasites with 
repercussions for other organisms. Additionally, the identification 
of RNase II as a parasite protein controlling the expression of viru- 
lence genes involved in pathogenesis in patients with severe malaria 
may provide new strategies for reducing malaria mortality. 
Beyond histone modifying enzymes’ “°, additional post-transcriptional 
mechanisms may control antigenic variation of the P. falciparum multi- 
copy var gene family. RNA processing and degradation in P. falciparum 
erythrocytic-stage parasites has been inadequately studied, and there- 
fore its potential role in the post-transcriptional control of virulence genes 
is unknown. We examined the RNA exosome, a RNase complex involved 
in RNA processing and decay associated with RNA quality control in 
the nucleus and cytoplasm of eukaryotic cells'*’”. Bioinformatic analysis 
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predicts eight putative RNA exosome-associated proteins in the P. falcip- 
arum genome, including exoribonuclease functional domain-containing 
proteins Dis3 and Rrp6 (Extended Data Table 1) (PlasmoDB site at 
http://plasmodb.org). Here we identify a non-canonical exoribonuclease 
containing a putative RNase II domain with no further homology to 
known exosome subunits that regulates upsA var gene expression, des- 
ignated PfRNase II (PlasmoDB accession number PF3D7_0906000). 
Structural prediction revealed homology between the RNase II do- 
main of PfDis3 and human Dis3, and between PfRNase II and yeast 
Rrp44 (Extended Data Figs 1 and 2). By western blotting, antibodies 
against PfRNase II, PfDis3 and PfRrp6 reacted with bands of the pre- 
dicted molecular masses 214, 135 and 126 kDa, respectively (Fig. 1a). To 
investigate the association between PfRNase II and core exosome com- 
plex, we generated a PfRrp4—haemagglutinin (HA)-tagged core exosome 
member transfectant and produced an antibody against another core exo- 
some member, PfRrp45 (Extended Data Fig. 3a—d). Co-immunoprecipitation 


(CoIP) experiments showed that PfRNase II is a non-exosome exori- 
bonuclease, whereas PfRrp6 and PfDis3 are the catalytic subunits of 
the plasmodial RNA exosome (Fig. 1b). Immunofluorescence assay (IFA) 
analysis revealed that PfRrp6 and PfRNase II localize as distinct foci at 
the nuclear periphery and that PfDis3 is localized in the cytoplasm ad- 
jacent to the nucleus at the ring stage (Fig. 1c). Immunoelectron micro- 
scopy confirmed the nuclear localization of PfRNase II (Extended Data 
Fig. 3e, f). To assess the RNA processing activity of the putative RNase II- 
like domain, we produced recombinant RNase II domains of PfRNase II 
and PfDis3 as glutathione S-transferase (GST) fusion proteins, using a 
PfRNase II-dead mutant and GST alone as negative controls (Fig. 1d 
and Extended Data Fig. 3g). In vitro RNA degradation assays revealed 
processive 3'-5' hydrolytic activity towards single-stranded but not 
double-stranded RNA for PfRNase II and PfDis3, and no activity with 
the PfRNase II domain-dead mutant or GST alone (Fig. le and Extended 
Data Fig. 3h-j). 
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Figure 2 | Defect in PfRNase II leads to derepression of upsA var genes. 

a, Top: schematic diagram of PfRNase II-FKBP fusion protein. Bottom: growth 
curve of PfRNase II-FKBP clone H3 with or without Shld1. Parent 3D7-G7 was 
used as a control. Error bars represent s.e.m. for three independent 
experiments. b, Global transcriptome comparison of ring-stage parasite clones. 
Top: PfRNase II-FKBP Shld1* versus Shld1~. Bottom: PfRNase II-FKBP 
Shld1* versus wild-type 3D7-G7. Both axes are logarithmic and correspond to 
normalized read numbers of individual genes shown as dots. ¢, Statistical 
analysis of transcript level changes in PfRNase II-FKBP Shld1* over 
3D7-G7. The medians are indicated as a line, and whiskers illustrate the 
interquartile range. The cutoff (twofold) is indicated by a dashed line. Mito., 
mitochondrial. ***, P< 0.001; **, 0.001 < P<0.01; *, 0.01 < P< 0.05 (two- 
tailed Student’s t-test). d, Transcriptional patterns of var gene family in 
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ring-stage PfRNase II-FKBP-H3, G1, and C3 clones (Shld1* or Shld1) 
determined by qPCR. The five ups subtypes are indicated on the top. The data 
are shown as fold change of PfRNase II-FKBP over 3D7-G7 related to the seryl- 
tRNA synthetase gene as an internal control. e, Transcriptional profiles of 
intronic antisense long non-coding RNA (IncRNA) from three individual var 
genes with different ups-types in PfRNase II-FKBP clones compared with 3D7- 
G7 measured by qPCR. Data are represented as means = s.e.m. for three 
independent experiments. f, Northern blot assay of 3D7-G7 and PfRNase II- 
FKBP-H3 and G1 clones. The probe corresponding to each clone was C1 var, 
Al var and A2 var, respectively. The full-length transcript of each var gene is 
indicated by an arrow. Loading control for total RNA is shown by ethidium 
bromide staining. Data in a and f are representative of three independent 
experiments. 
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Attempts to disrupt PfRNase II in parasites by using conventional 
knockout strategies failed, indicating that this gene is essential for para- 
site growth (Extended Data Fig. 4a). We therefore engineered a ligand- 
regulated FK506-binding protein (FKBP) destabilization domain (DD) 
fusion protein to investigate PfRNase II protein knockdown in a para- 
site clone (3D7-G7) expressing a single upsC var gene, P£3D7_0412700) 
(Fig. 2a). The DD fusion protein is stabilized by the synthetic ligand Shld1 
and is rapidly degraded in its absence'*. Three independent PfRNase II- 
FKBP clones G1, H3 and C3 were obtained (Extended Data Fig. 4b, c), 
and showed comparable propagation rates during the first two eryth- 
rocytic cycles. Without Shld1, however, PfRNase II-FKBP protein levels 
were decreased and parasites revealed drastic morphological changes in 
mature stages of the second cycle and were deficient in progressing to a 
third cycle (Fig. 2a and Extended Data Fig. 4d). To study the transcrip- 
tional role of PfRNase I, we used RNA-seq analysis on total RNA from 
synchronized ring-stage parasites 48 h after the removal of Shld1 (Ex- 
tended Data Fig. 5a). Transcription patterns for the PfRNase II-FKBP 
clone H3 in the presence or absence of Shld1 were comparable except 
for 22 out of 37 mitochondrial genes with transcriptional upregulation 
(at least twofold) (Fig. 2b and Extended Data Fig. 5b), possibly explain- 
ing the observed lethal phenotype. Next, we compared transcriptional 
profiles of PfRNase II-FKBP parasites with those of wild-type parasites 
(3D7-G7). We observed more than fivefold upregulation in transcript 
levels for 213 genes in PfRNase II-FKBP in comparison with wild-type 
parasites (Fig. 2b, Extended Data Fig. 5c and Supplementary Table 1). 
This suggests that the carboxy-terminal FKBP tagging of PfRNase II 


LETTER 


disrupts interactions that are important for gene silencing or that it in- 
terferes with RNase II activity. Almost all transcriptional changes were 
observed for clonally variant virulence genes and non-coding RNA tran- 
scribed from regions adjacent to var genes. Strikingly, 14 upsA var genes 
are upregulated 5-fold to 270-fold, whereas most other subtypes (upsB 
and upsC) showed relatively minor changes. Moreover, two non-coding 
RNAs encoded by the 15-member family of GC-rich elements adjacent 
to internal var genes“ or from the subtelomeric non-coding TARE3 
region showed strong upregulation in ring-stage parasites (Fig. 2c and 
Extended Data Fig. 5d). These data indicate the existence of a new type 
of post-transcriptional gene silencing of virulence genes. 

To validate the observed transcriptional changes in severe malaria- 
associated upsA var genes, we performed real-time quantitative PCR 
(qPCR) for individual var genes in parasite clones. Despite an increase 
in upsA var-type transcript levels in PfRNase II-FKBP transgenic para- 
sites, only few members reached transcript levels observed during mono- 
allelic expression. Up to three distinct upsA members were found to be 
co-transcribed at very high levels together with the upsC var C1 (3D7_ 
0412700) expressed already in 3D7-G7 (Fig. 2d, f and Extended Data 
Fig. 6a). The upsA subgroup var genes are rarely activated in cultured 
parasites’>'*. Each analysed clone expressed a different combination 
of upsA members, suggesting that switching occurs frequently in this sub- 
group, whereas upsC var C1 expression remained stable. Control exper- 
iments (3D7 parasites grown with Shld1 or harbouring a non-integrated 
FKBP plasmid) revealed a single dominant upsC but no upsA var gene 
(Extended Data Fig. 6b). To further demonstrate dissociation of the 
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Figure 3 | Monoallelic expression of var gene family is controlled by 
chromatin-associated RNase II. a, Two-colour RNA-FISH (left) and 
statistical analysis of co-localization (right) of var transcripts in PfRNase II- 
FKBP-H3 and GI clones. Red, UpsC-type C1 var probe; green, upsA-type Al 
var, A2 var and control gene probes (fructose-bisphosphate aldolase (FBA), 
PF3D7_1444800; Glutaminyl-tRNA synthetase (1370): PF3D7_1331700). Co., 
co-localization; adj., adjacent; non-co., non-co-localization. ***, P< 0.001 ( va 
test). Nuclear DNA was stained by DAPI (blue). Scale bar, 1 jsm. Error bars 
represent s.e.m. for three independent experiments. b, Live-cell IFA using 
antisera to various PfEMP 1s to detect co-expression of var genes in PfRNase II- 
FKBP H3 and GI clones, respectively. The 3D7-G7 control expresses only one 
type of PfEMP1 (C1 var). Scale bar, 5 um. c, PfRNase I] is linked to the silent 


var-associated loci. Left: combined FISH/IFA assay using anti-PfRNase II 
antibody (IFA) with a FISH probe (PF3D7_0600400) that detected the upsA var 
expression site (RNA-FISH) in PfRNase II-FKBP clone H3. The same upsA var 
gene (DNA-FISH/IFA) was also analysed in ring-stage 3D7-G7 parasites. 
Right: statistical analysis of co-localization of each pair. In each analysis, more 
than 100 nuclei were counted; the error bars represent s.e.m. for three 
independent experiments. ***, P< 0.001 (7’ test). d, ChIP-qPCR of 
PfRNase II-HA transfectant. The enrichment of representative var genes with 
anti-HA antibody were shown respectively. The qPCR probes for each var 
gene are indicated by short lines under var gene loci. Error bars represent s.e.m. 
for three independent experiments. The data in b are representative of three 
independent experiments. TARE3, telomere-associated repeat element 3. 
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upsA var gene regulation, we cultured PfRNase II-FKBP parasite clones 
for a further 70 days (35 generations). qPCR analysis confirmed that 
most upsA var genes were upregulated in mutant clones but not in 
parent 3D7-G7 parasites, relative to day 0 (Extended Data Fig. 7a). For 
upsB and upsC var genes, some members showed higher transcript levels 
at day 70 but no striking differences between wild-type and mutant par- 
asites. Subclone analysis of the C3 bulk culture (70 days) showed that 
new combinations of upsA var members co-transcribed with different 
members of upsC (Extended Data Fig. 6a). These data further illustrate 
the PfRNase II-dependent loss of monoallelic expression and continual 
switching of upsA var genes. In the PfRNase II-FKBP clones, a more 
than 60-fold transcriptional upregulation of antisense long non-coding 
RNA was observed from the upsA var gene intron region in ring-stage 
parasites, confirming the RNA sequencing result (Fig. 2e and Extended 
Data Fig. 8) and further highlighting a previously suggested role for 
exon 1 antisense long non-coding RNA in the process of var gene 
expression’!”!8, 

Single-cell analysis using two-colour RNA-fluorescence in situ hybrid- 
ization (FISH) and surface immunofluorescence analysis confirmed the 
decoupling of upsA var genes from monoallelic expression in PfRNase II- 
FKBP parasites (Fig. 3a, b and Extended Data Fig. 7b). Using PfRNase II 
immunofluorescence with DNA-FISH (silent var) or RNA-FISH (active 
var A1), we observed a co-localization preference of PfRNase II protein 
to the silent var locus versus the active locus (Fig. 3c), indicating that 
removal of PfRNase II from the silent upsA var locus may trigger the gene 
activation in situ. We generated endogenous HA-tagged PfRNase II par- 
asites to investigate its chromatin association by chromatin immuno- 
precipitation (ChIP)-qPCR, and confirmed that the C-terminal HA tag 
of PfRNase II did not interfere with monoallelic var expression (Extended 
Data Fig. 9a-d). ChIP analysis showed higher enrichment of PfRNase II 
protein at promoter and intron regions of upsA var genes, significantly 
less enrichment at other var gene types, and no association with the 
active one (Fig. 3d and Extended Data Fig. 9e). Given the potential role 
of PfRNase II in messenger RNA degradation, we performed nascent 
RNA analysis at var loci to study whether unstable mRNA was produced 
that was absent from steady-state RNA in wild-type 3D7-G7. We observed 
15-20-fold higher levels of 5’ upsA var transcripts in nascent RNA than 
in total RNA. Non-upsA var genes and control genes showed slight, if 
any, difference between nascent and steady-state RNA levels (Fig. 4a), 
indicating cryptic mRNA production from upsA var genes. Further ana- 
lysis revealed that both upstream region and intron promoter-derived 
antisense long non-coding RNA were the main sources of cryptic RNAs 
produced by upsA var genes (Fig. 4b and Extended Data Fig. 7c). 

To investigate the clinical relevance of our findings, we extracted total 
P. falciparum RNA from blood from patients with severe malaria and 
from patients with uncomplicated malaria. As observed previously”, 
upsA transcript levels were higher in patients with severe malaria than 
in those with uncomplicated malaria (Extended Data Fig. 7d), but 
PfRNase II transcripts were inversely correlated with upsA transcript 
levels (Fig. 4c), thereby linking decreased PfRNase II expression to upsA 
var gene regulation in patients with severe malaria. 

Our findings identify a novel epigenetic silencing pathway in mal- 
aria parasites that is distinct from the post-transcriptional repression 
mechanism described previously”. Silenced upsA var genes continu- 
ally produce short-lived sense and antisense transcripts from promoter 
and intron regions as a result of rapid in situ degradation by a novel 
type of RNase II. This may explain the several var transcripts detected 
in single infected erythrocytes (ring stage) by PCR”'. By producing RNA 
from virtually all upsA var genes in RNase II-deficient parasites, sin- 
gular expression of var genes is abolished. This raises the possibility 
that antisense long non-coding RNA is involved in the var promoter 
activation process. Our data show a resemblance to the degradation of 
cryptic unstable nuclear RNA by the exosome observed in yeast””. How- 
ever, PfRNase II exerts its nuclear function dissociated from the RNA 
exosome core at specific chromatin regions (Extended Data Fig. 9f). 
We assume that the selective removal of PfRNase II at a specific upsA 
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Figure 4 | In situ degradation of nascent RNA at promoter regions leads to 
silencing of severe malaria-associated upsA-type var genes. a, Comparative 
qPCR var transcription analysis from nascent and steady-state mRNA in 
ring-stage 3D7-G7 wild-type parasites. Primers are designed against the 
upstream promoter regions of ups-specific or individual var and control 
genes. upsCs primers are silent upsC var-specific. The seryl-tRNA synthetase 
gene was used as an internal control for the calculation of relative copy 
numbers. Error bars represent s.e.m. for three independent experiments. 

b, Transcription profiles of nascent intronic antisense long non-coding RNA 
(IncRNA) from three individual var genes with different ups types from the 
steady-state RNA measured by qPCR assay. Transcription levels are shown 
as fold change of nascent RNA over total RNA. Error bars represent s.e.m. for 
three independent experiments. c, Transcription analysis of PfRNase II gene 
in field isolates with higher transcript levels of upsA var genes (from 

patients with severe malaria) and with lower levels (from patients with 
uncomplicated malaria). The LSA-1 and SSP2 genes with comparable 
transcriptional levels to those of PfRNase II are used as controls. **, P< 0.01 
(two-tailed Student’s t-test). 


var locus allows expression of that locus, a notion supported by our co- 
localization experiments (Fig. 3c). Upregulation of other types of RNA 
in PfRNase II mutant parasites indicates that this type of mechanism 
might be a wide-ranging gene expression control mechanism. 

In eukaryotic organisms, monoallelic expression controls distinct bio- 
logical processes. For example in mice, only one of the ~1,400 genes 
encoding olfactory receptors is expressed in any given olfactory sensory 
neuron”’. Similarly, protozoan parasites that cause malaria, giardiasis 
or sleeping sickness express one of many possible variant surface anti- 
gen genes at a time™*. This work identifies an entirely new mechanism 
contributing to monoallelic gene expression in malaria parasites that 
may be relevant in other organisms. In addition, the identification of 
the first Plasmodium protein controlling virulence factors expressed in 
patients with severe malaria may provide new strategies for reducing 
malaria pathogenesis. 


©2014 Macmillan Publishers Limited. All rights reserved 


METHODS SUMMARY 


The association of exosome-like exoribonucleases and the core exosome complex 
was investigated by CoIP assays. FKBP destabilization domain and triple HA tag- 
ging at the C terminus of PfRNase II gene were carried out using the single-crossover 
recombinant strategy. The PfRNase II-FKBP clones were used in the comparative 
transcriptome analysis with wild-type 3D7-G7 clone by single-stranded RNA se- 
quencing analysis”. The phenotypes of PfRNase II-FKBP clones were determined 
by RNA-FISH, IFA and fluorescence-activated cell sorting (FACS) with individual 
PfEMP1-specific antibodies in live infected red blood cells (iRBCs). The chromatin- 
associated PfRNase II was validated by electron microscopy, RNA-FISH, DNA- 
FISH and ChIP-qPCR analysis. We isolated the nascent RNA in a 3D7-G7 clone 
and used qPCR to measure the level of various regions of upsA var genes with other 
type var genes and housekeeping genes as controls, and compared them with those 
of steady-state RNA. All the primers used in this study are shown in Supplemen- 
tary Table 2. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Endocrinization of FGF1 produces a neomorphic and 


potent insulin sensitizer 
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Fibroblast growth factor 1 (FGF1) is an autocrine/paracrine regula- 
tor whose binding to heparan sulphate proteoglycans effectively pre- 
cludes its circulation’”. Although FGF1 is known as a mitogenic factor, 
FGF1 knockout mice develop insulin resistance when stressed by a 
high-fat diet, suggesting a potential role in nutrient homeostasis**. 
Here we show that parenteral delivery of a single dose of recombi- 
nant FGF1 (rFGF1) results in potent, insulin-dependent lowering of 
glucose levels in diabetic mice that is dose-dependent but does not lead 
to hypoglycaemia. Chronic pharmacological treatment with rFGF1 
increases insulin-dependent glucose uptake in skeletal muscle and 
suppresses the hepatic production of glucose to achieve whole-body 
insulin sensitization. The sustained glucose lowering and insulin sensi- 
tization attributed to rFGF1 are not accompanied by the side effects 
of weight gain, liver steatosis and bone loss associated with current 
insulin-sensitizing therapies. We also show that the glucose-lowering 
activity of FGF1 can be dissociated from its mitogenic activity and is 
mediated predominantly via FGF receptor 1 signalling. Thus we have 
uncovered an unexpected, neomorphic insulin-sensitizing action for 
exogenous non-mitogenic human FGF1 with therapeutic potential 
for the treatment of insulin resistance and type 2 diabetes. 

Increases in the prevalence of obesity and insulin resistance and the 
associated incidence of difficult-to-manage type 2 diabetes have become 
world-wide public health problems as well as a financial burden for the 
health care system, emphasizing the urgent need for improved insulin- 
sensitizing therapies. Thiazolidinediones are highly effective oral medica- 
tions for type 2 diabetes that act through the nuclear receptor peroxisome 
proliferator activated receptor y (PPARy) to control networks of genes 
involved in adipogenesis, lipid metabolism and insulin sensitization. 
However, the unique sensitization benefits of thiazolidinediones are com- 
promised by detrimental side effects, including weight gain, bone loss 
and congestive heart failure, suggesting that targeting downstream effec- 
tors of PPARy may evoke fewer side effects yet retain insulin-sensitizing 
potential®. In this regard, FGF21, a member of the endocrine FGF sub- 
family whose expression is regulated by PPAR; in adipose tissue, has been 
identified as an effective glucose-lowering agent in rodents*”. Recently, 
FGFI, the prototype of the FGF family of proteins, was also found to be 
transcriptionally regulated by PPAR; in adipose tissue, and Fgf1 knock- 
out mice exhibit an aggressive insulin-resistant phenotype when stressed 
by a high-fat diet**. The affinity of FGF1 for heparan sulphate proteo- 
glycans results in autocrine/paracrine signalling and limited serum expo- 
sure, distinguishing it from the endocrine FGFs, which include FGF21 
(refs 1, 2, 8, 9). These studies, combined with those demonstrating that 
FGF receptor agonist-antibodies modulate glucose homeostasis"° raised 
the question as to whether endocrinization of the non-endocrine FGF1 
could elicit glucose-lowering effects. 


To explore its therapeutic potential, recombinant murine FGF1 (rFGF1) 
was injected subcutaneously (subQ) into genetically induced (ob/ob and 
db/db) as well as diet-induced obese (DIO) insulin-resistant mice. We 
found that a single injection of 0.5 mg kg ' rFGF1 was sufficient to attain 
normoglycaemia in the severely hyperglycaemic ob/ob mice (Fig. 1a). 
Maximal glucose lowering was achieved within 18-24 h, and sustained 
effects observed for more than 48 h. Moreover, this effect was dose depen- 
dent, but even at the maximal dose (2.0 mg kg‘) it did not result in hypo- 
glycaemia (Fig. 1b and data not shown). Potent glucose lowering was 
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Figure 1 | Acute rFGF1 injection lowers glucose in diabetic mice. a, Blood 
glucose levels in ob/ob mice after rFGF1 (filled squares, n = 6) or control vehicle 
(open squares, m = 4) injection. b, Dose response of rFGF1 on blood glucose 
levels in ob/ob mice 24h after injection (n = 3). c—f, Blood glucose levels after 
rFGF1 injection in db/db (n = 3) (c), DIO (n = 6) (d), normal-chow-fed 

(n = 8) (e) and fasted ob/ob (n = 6 for control; n = 5 for rFGF1) (f) mice. 

g, Blood glucose levels in ob/ob mice before (open bars) and 24h after (filled 
bars) injection of murine FGF peptides (control, FGF1, FGF2: n = 4; FGF9, 
FGF10: n = 2). h, Blood glucose levels in ob/ob mice after human rFGF1 (filled 
bars, n = 4) or control vehicle (open bars, n = 4). C, control. Recombinant FGF 
peptides (0.5 mgkg *) or control vehicle (PBS) were injected subcutaneously 
to ad libitum fed mice unless otherwise noted. Values are means and s.e.m. 
Statistics by two-tailed t-test: *P < 0.05; **P < 0.01. 
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observed in both db/db and DIO mouse models, and rFGF1 was effec- 
tive when delivered either intraperitoneally or intravenously in ob/ob 
mice, independently of exogenous heparin (Fig. 1c, dand Extended Data 
Fig. la-c). rFGF1 had no effect on blood glucose or insulin levels in nor- 
moglycaemic chow-fed mice (Fig. le and Extended Data Fig. 1d, e). Con- 
sistent with the known effects of FGF1 on feeding'’”’, we observed a 
transient suppression of food intake that correlated with a temporary 
decrease in body weight (Extended Data Fig. 1f-i and data not shown). 
However, injection of rFGF1 similarly reduced glucose levels under fast- 
ing conditions, dissociating the glucose-lowering effects of rFGF1 from 
its effect on food intake (Fig. 1f). FGF1 is considered the universal ligand 
for FGF receptors (FGFRs) in its ability to bind and activate each of the 
alternatively spliced forms of the four tyrosine kinase FGF receptors 
(FGFR1-FGFR4), whereas other members of the FGF superfamily dem- 
onstrate receptor specificity’. To determine whether other autocrine/ 
paracrine FGFs have similar blood glucose lowering activity when given 
pharmacologically, we tested a selection of FGFs with specificities cov- 
ering all seven FGFR receptors. FGF1 seems unique in its ability to lower 
blood glucose: FGF2, FGF9 and FGF10 failed to do so (Fig. 1g). Further- 
more, recombinant human FGF1 (hFGF1) was similarly able to normalize 
blood glucose in diabetic mice, suggesting an evolutionarily conserved 
pathway (Fig. 1h). 

The pronounced glucose-lowering efficacy of a single injection of rFGF1 
led us to investigate the effects of serial injections. ob/ob mice injected 
with 0.5 mgkg “ every other day for 35 days (chronic treatment) did not 
develop any apparent resistance to rFGF1, exhibiting sustained glucose 
lowering with minimal changes in body weight or composition (Fig. 2a, b 
and Extended Data Fig. 2a, b). A similar glucose-lowering effect was 
also seen in pair-fed ob/ob mice, indicating that the transient reduction 
in food intake after chronic treatment with rFGF1 does not account for 
the beneficial glucose-lowering effects (Extended Data Fig. 2c, d). The 
fasting blood glucose levels of chronically rFGF1-treated ob/ob mice were 
50% lower than in PBS-treated control mice, and remained lower through- 
out the glucose tolerance test (GTT) with a coincident decrease in insulin 
levels. Furthermore, rFGF1-treated mice showed a marked improve- 
ment in insulin sensitivity as measured by an insulin tolerance test (ITT) 
(Fig. 2c-e). Although there was no significant effect on free fatty acids, 
cholesterol and triglyceride levels in serum (Extended Data Table 1 and 
Extended Data Fig. 2e, f), chronic treatment with rFGF1 decreased hepatic 
steatosis and increased liver glycogen content, as shown histologically 
and by quantitative analyses (Fig. 2f-h). No significant changes were 
detected in serum metabolic hormone levels (Extended Data Fig. 2g). 
Furthermore, chronic treatment with rFGF1 of DIO mice, a strain that 
more closely models the majority ofhuman type 2 diabetes, also resulted 
in pronounced and sustained lowering of blood glucose levels (Fig. 2i) as 
well as in increased insulin sensitization as measured by GTT and ITT 
(Fig. 2), k). These beneficial effects in DIO mice were observed without 
significant changes in body weight, organ weights and feeding trends 
(Extended Data Fig. 2h-j). 

The potent and rapid normalization of blood glucose by parenteral 
rFGF1 led us to investigate potential insulin secretagogue or insulin- 
mimetic activities of rFGF1. rFGF1 administered by injection had no 
significant effect on glucose-stimulated insulin secretion in isolated pan- 
creatic islets and did not increase serum insulin levels under basal condi- 
tions or during a GTT, indicating that exogenous FGF1 does not stimulate 
pancreatic B-cell insulin release, either ex vivo or in vivo (Fig. 2d and 
Extended Data Fig. 3a-c). Next, we tested rFGF1 efficacy in mice rendered 
diabetic by streptozotocin-induced destruction of insulin-producing 
B-cells. In this diabetic mouse model (STZ mice), rFGF1 alone failed to 
lower blood glucose levels, indicating that rFGF1 is not an insulin mimetic 
(Fig. 3a). However, pretreatment of STZ mice with rFGF1 markedly 
enhanced the glucose-lowering effects of exogenously supplied insulin 
(Fig. 3b, c). Conversely, the rFGF1-dependent increase in metabolic clear- 
ance rate was severely blunted when insulin secretion was inhibited by 
somatostatin (Fig. 3d-g). Improved sensitivity to insulin often correlates 
with reduced systemic inflammation; indeed, chronic treatment of ob/ob 
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Figure 2 | Chronic administration of rFGF1 achieves sustained glucose 
lowering and insulin sensitization in diabetic mice. a, b, Random-fed blood 
glucose (a) and body weight (b) during chronic treatment of ob/ob mice with 
rFGF1. c-e, GTT (c), insulin levels during GTT (d) and ITT (e) measured in 
ob/ob mice after 4 weeks of rFGF1 treatment (control n = 6, rFGF1 n = 8). 
f-h, Representative haematoxylin and eosin (H&E) and periodic acid-Schiff 
staining (PAS; magenta represents glycogen) (f), and triglyceride (g) and 
glycogen (h) content of ob/ob livers after 5 weeks of chronic treatment with 
rFGF1 (n = 6 for overnight fasted control; n = 8 for rFGF1). i-k, Random-fed 
blood glucose (i), GTT (j) and ITT (k) in DIO mice after 3 weeks rFGF1 
treatment (n = 6). Ad libitum fed mice were injected subcutaneously with 
0.5mgkg | rFGF1 (filled bars and symbols) or control vehicle (PBS, open bars 
and symbols) every 48h. Values are means and s.e.m. Statistics by two-tailed 
t-test: *P < 0.05; **P < 0.01; ***P < 0.005. Scale bar, 200 tm. 


mice with rFGF1 decreased serum levels of several inflammatory cytokines 
(eotaxin, keratinocyte chemoattractant (KC), Mip-1b and interleukin-3) 
(Extended Data Fig. 3d). These results demonstrate that rFGF1 operates 
in an insulin-dependent manner to lower blood glucose, and suggest that 
parenteral rFGF1 may act as an insulin sensitizer. 

To investigate the physiological mechanisms of rFGF1 action further, 
we performed hyperinsulinaemic—euglycaemic clamp studies in DIO mice 
chronically treated with vehicle or rFGF1. The steady-state glucose infu- 
sion rate during the clamp was ~75% higher in rFGF1-injected mice, 
indicating increased responsiveness to insulin (Extended Data Fig. 4a). 
The ability of insulin to suppress hepatic glucose production was improved 
in rFGF1-injected mice, revealing increased hepatic insulin sensitivity 
as a long-term consequence of rFGF1 injection (Fig. 4a). Liver gene expres- 
sion analyses from chronically rFGF1-treated DIO mice revealed signi- 
ficant reductions in macrophage markers, for example F4/80 and Cd1lIc, 
and in inflammatory cytokines, for example I/-1«, II-1f and Tnf- (Ex- 
tended Data Fig. 4b). Furthermore, whole-body and insulin-stimulated 
glucose disposal rates were ~47% and ~80% higher, respectively, in 
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Figure 3 | rFGF1 induces insulin-dependent glucose uptake. a, Blood 
glucose levels in STZ-induced diabetic mice 8 h after subcutaneous injection of 
control PBS (open bars) or rFGF1 (0.5 mgkg’, filled bars). b, c, ITT (b) and 
area under the ITT curve (AUC; c) for control and rFGF1-treated STZ mice 
(open and filled squares and bars, respectively; n = 4). d—g, Blood glucose level 
(d, e) and metabolic clearance rate (MCR; f, g) in ob/ob mice after control PBS 
(open bars) or rFGF1 injection (0.2 mgkg ' intravenously, filled bars) 

before (d, f) or 1h after (e, g) infusion of somatostatin (SOM) (n = 5). Values 
are means and s.d. Statistics by two-tailed t-test: *P < 0.05; **P < 0.01. 


rFGF1-injected mice, reflecting enhanced peripheral sensitivity to insu- 
lin (Fig. 4b, cand Extended Data Fig. 4c-f). Consistent with the hepatic 
and peripheral insulin-sensitizing effects of rFGF1, insulin-stimulated 
AKT signalling was enhanced in both the liver and muscle of chronically 
rFGF1-treated DIO mice (Extended Data Fig. 4g, h). Taken together, 
these findings demonstrate that chronic administration of rFGF1 leads 
to sustained glucose lowering and whole-body insulin sensitization. 
The above studies demonstrating robust and sustained glucose low- 
ering in multiple diabetic mouse models raised the possibility of paren- 
teral administration of rFGF1 as a diabetic therapy. In support of this 
notion, diabetic mice chronically treated with rFGF1 were phenotypically 
normal in terms of locomotor activity, oxygen consumption and respi- 
ratory exchange ratio (Extended Data Fig. 4i-n), suggesting that chronic 
treatment with rFGF1 does not evoke adverse pleiotropic effects. Although 
endogenous FGF1 has been shown to have a role in adipose remodelling, 
histological examination of this tissue found no abnormalities after chronic 
treatment with rFGF1 (Extended Data Fig. 40). Furthermore, serum crea- 
tine kinase levels did not change in chronically rFGF1-treated ob/ob mice, 
indicating the absence of muscle tissue damage (Extended Data Fig. 4p). 
Although thiazolidinediones are the only therapeutic insulin sensiti- 
zers, their application is limited by adverse side effects, including weight 
gain, increased liver steatosis and bone fractures’. Notably, chronic treat- 
ment with rFGF1 did not lead to weight gain, and there was a decrease 
in hepatic steatosis (Fig. 2f, Extended Data Fig. 2i, ) and Extended Data 
Table 1). Similar to thiazolidinediones, endocrine FGF21, which has 
recently been shown to have potential therapeutic blood glucose-lowering 
effects, has also been associated with a loss of bone density’*. In contrast, 
bone mineral density, trabecular bone architecture and cortical bone thick- 
ness were not affected by chronic treatment with rFGF1 in DIO mice, as 
determined by micro-computed tomography analyses (Extended Data 
Fig. 4q, r). Furthermore, total and high-molecular-weight serum adipo- 
nectin levels were not altered by chronic treatment with rFGF1, differ- 
entiating its mechanism of action from the adiponectin-dependent 
glucose-lowering effects of FGF21 (refs 15, 16) (Extended Data Fig. 4s). 
As the prototype of a growth factor family, rFGF1 has the potential to 
induce unwanted cell proliferation, and concern resides in whether the 
mitogen properties of rFGF1 could be dissociated from its glucose- 
lowering activities. To address this question we generated an FGF1 ligand, 
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Figure 4 | Chronic administration of rFGF1 is insulin sensitizing. 

a-c, Insulin-stimulated suppression of hepatic glucose production (HGP) 

(a), steady-state glucose disposal rate (b) and insulin-stimulated glucose 
disposal rate (c) measured during hyperinsulinaemic-euglycaemic clamps on 
DIO mice after 3 weeks of control PBS (Veh., open bars, n = 11) or rFGF1 
(0.5 mgkg ' subcutaneously every other day; filled bars, n = 9) treatment. 

d, Proliferative activity (measured by the absorbance of formazan at 600 nm) of 
NIH3T3 cells treated with rFGF1 (red) or rFGFIANT (blue) at the indicated 
concentrations (experiment repeated three times). e, Blood glucose levels of 
ob/ob mice treated with control PBS (open symbols, n = 12), rFGF1 

(0.5mgkg ' subcutaneously; filled symbols, n = 8) or fFGF1ANT (0.5 mgkg* 
subcutaneously; red dashed line, n = 6). f, Blood glucose levels of DIO mice 
treated with control PBS (open bars), rFGF1 (0.5 mgkg * subcutaneously, 
filled bars), or rFGF14N" (0.5 mgkg' subcutaneously, striped bars) at the 
indicated times (n = 10). g, Blood glucose levels in 8-month-old mice fed on a 
high-fat diet: Fefr iE" “ (WT, n= 5) and aP2-Cre;Fefr “ (R1 knockout (KO), 
n = 4) mice at 0h (open bars) and 24h (filled bars) after treatment with rFGF1 
(0.5 mgkg~' subcutaneously). Values are means and s.e.m. Statistics by 
two-tailed t-test: *P < 0.05; **P < 0.01; ***P < 0.005; n.s., not significant. 


rFGFIONT, lacking the first 24 residues from the amino terminus. On 
the basis of the crystal structures of FGF1-FGFR complexes, the trunca- 
tion was predicted to have significant effects on the binding affinity of 
FGF] for all FGFRs, and similar truncations have been shown to decrease 
FGF1 mitogenicity. Consistent with this prediction, rFGF1“' showed 
a marked decrease in binding affinity for FGFRs compared with the native 
ligand, yet was still able to bind FGFR1c and FGFR2c, although with lower 
affinity (Extended Data Fig. 5a, b). In vitro, rFGF 14NT showed a some- 
what attenuated ability to activate intracellular signals downstream of 
FGFRs but showed a severe decrease in mitogenic activity (Fig. 4d and 
Extended Data Fig. 6a). Parenteral delivery of rFGF1““" dose-dependently 
lowered blood glucose levels in both genetically induced and diet-induced 
mouse models of diabetes (Fig. 4e, fand Extended Data Fig. 6b). rFGF1“N7 
also retained the feeding suppression effects observed with rFGF1 (Ex- 
tended Data Fig. 6c). The synthetic effects of exogenous rFGF1 on phys- 
iology, such as glucose homeostasis and feeding behaviour, therefore 
differ from, and are independent of, its classical role as a growth factor 
and mitogen. 

In exploring the receptor dependence of the observed glucose-lowering 
effects, we speculated that the effects were mediated through FGFR1, 
on the basis ofits known role in insulin sensitivity’® and the observation 
that rFGF14NT retained FGFR1c-binding affinity in our surface plasmon 
resonance studies. Furthermore, our previous studies on Fgf1 knockout 
mice implicated adipose tissue as a major site of FGF1 action. Accord- 
ingly, we generated mice lacking Fgfrl predominantly in adipose tissue 
(aP2-Cre;Fefr!" if ,R1 knockout mice). Indeed, although rFGF1 potently 
lowered blood glucose levels in control diabetic mice, it failed to lower 
glucose levels in the R1 knockout mice (Fig. 4g), indicating a requirement 
for FGFR1. Consistent with this, rFGF1 A“ similarly failed to affect blood 
glucose levels in R1 knockout mice (Extended Data Fig. 6d). An FGF1 
analogue with severely attenuated FGFR-mediated signalling in vitro 
(FGF1 L29-D155; rFGF14N™) did not significantly affect blood glu- 
cose levels in diabetic mice (Extended Data Fig. 6e, f), further supporting 
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the notion that FGFR1-mediated signalling is required for the glucose- 
lowering effects of parenteral rFGF1. Taken together, these studies iden- 
tify exogenous or endocrinized rFGF1 as a potent and long-lasting insulin 
sensitizer that seems to circumvent the adverse side effects associated 
with other diabetic therapies. 

Previously we identified a role for endogenous FGF1 in the adaptive 
remodelling of visceral adipose tissue in response to nutrient fluctua- 
tions. The profound metabolic dysregulation observed in FGF1 knock- 
out mice when stressed by a high-fat diet was associated with decreased 
vascularity in visceral adipose depots, consistent with the known role of 
FGF1 in angiogenesis. In contrast, our present findings identify a neo- 
morphic insulin-sensitizing action for FGF1 in which systemic delivery 
of the normally autocrine/paracrine FGF1 through parenteral routes 
results in potent and sustained correction of hyperglycaemia accompa- 
nied by whole-body insulin sensitization. 

The apparent divergent activities in vivo now ascribed to FGF1 seem 
to closely parallel those of FGF21, which has been assigned locally restricted 
roles in adipose tissue as well as systemic glucose-lowering activities. How- 
ever, FGF21 circulates as a true endocrine hormone, whereas the high- 
affinity heparan sulphate proteoglycan-binding activity and serum lability 
of FGF1 restrict its endogenous actions to local tissues, resulting in the 
rapid clearance of exogenous FGF1 from the circulation’. In addition, 
these two FGFs have disparate FGFR specificities: whereas FGF1 can bind 
and signal through each of the alternatively spliced forms of the FGFRs, 
FGF21 signalling requires a heterodimeric B-klotho-FGFR complex’. 
Our findings indicate that the metabolic effects of exogenous FGF1 are 
mediated through FGFR1 in adipose tissue; however, additional studies 
will be necessary to exclude the involvement of additional receptors and/ 
or tissues in the body-wide effects attributed to parenterally delivered 
rFGF1. In conclusion, given that the glucose-lowering effects of rFGF1 
were not accompanied by side effects associated with current insulin 
sensitizers, along with the discovery that non-mitogenic rFGF1“" retains 
glucose-normalizing capability, rFGF1 and its derivatives may hold ther- 
apeutic promise. 

Online Content Methods, along with any additional Extended Data display items 


and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Eukaryotic cells coordinately control anabolic and catabolic processes 
to maintain cell and tissue homeostasis. Mechanistic target of rapa- 
mycin complex 1 (mTORC1) promotes nutrient-consuming anabolic 
processes, such as protein synthesis’. Here we show that as well as 
increasing protein synthesis, mTORC] activation in mouse and human 
cells also promotes an increased capacity for protein degradation. 
Cells with activated mTORCI exhibited elevated levels of intact and 
active proteasomes through a global increase in the expression of 
genes encoding proteasome subunits. The increase in proteasome 
gene expression, cellular proteasome content, and rates of protein 
turnover downstream of mTORCI were all dependent on induction 
of the transcription factor nuclear factor erythroid-derived 2-related 
factor 1 (NRF1; also known as NFE2L1). Genetic activation of mTORC1 
through loss of the tuberous sclerosis complex tumour suppressors, 
TSC1 or TSC2, or physiological activation of mTORCI1 in response 
to growth factors or feeding resulted in increased NRF1 expression 
in cells and tissues. We find that this NRF1-dependent elevation in 
proteasome levels serves to increase the intracellular pool of amino 
acids, which thereby influences rates of new protein synthesis. Therefore, 
mTORC1 signalling increases the efficiency of proteasome-mediated 
protein degradation for both quality control and as a mechanism to 
supply substrate for sustained protein synthesis. 

In response to growth signals, mTORC1 promotes anabolic processes, 
such as protein synthesis, and its chronic activation is believed to under- 
lie a variety of complex human diseases, including cancer and metabolic 
diseases, as well as pathologies associated with ageing’. Cells must pos- 
sess mechanisms to coordinate protein synthesis with protein turnover 
to maintain amino acid and protein homeostasis, as even a small per- 
sistent imbalance between these processes can disrupt cell and tissue 
physiology**. Given the ubiquitous role of mTORC1 in stimulating pro- 
tein synthesis, we sought to assess the effects of mTORC1 activation on 
protein degradation. 

A protein complex comprising TSC1, TSC2 and TBC1D7 (the TSC 
complex) serves as a central negative regulator of mTORCI, with loss 
ofits components resulting in growth-factor-independent activation of 
mTORCI (ref. 1). Tsc2~/~ mouse embryonic fibroblasts (MEFs) exhibit 
growth-factor-independent activation of mTORCI, as scored by phos- 
phorylation of S6K1 and S6 (Extended Data Fig. 1a), which is blocked 
by the mTORC1 inhibitor rapamycin or reconstitution with human 
TSC2. Both TSC1- and TSC2-deficient MEFs displayed a 20-25% in- 
crease in the rate of de novo protein synthesis, which was abolished by 
rapamycin (Fig. laand Extended Data Fig. 1b). To analyse relative turn- 
over rates of newly synthesized proteins, the percentage of total labelled 
protein remaining over time was measured in a pulse-chase experiment 
(Extended Data Fig. 1c). Both TSC1- and TSC2-deficient cells displayed 
a rapamycin-sensitive increase in the rate of protein degradation (Fig. 1b 
and Extended Data Fig. 1d-f). This was surprising given the well-established 
role of mTORC1 in inhibiting autophagy, a lysosome-dependent mech- 
anism of degrading organelles and proteins*. While the lysosome inhib- 
itor chloroquine slowed the rate of protein degradation, cells lacking 
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TSC2 maintained a rapamycin-sensitive increase in protein turnover, 
and rapamycin also slowed rates of protein degradation in autophagy- 
deficient (Atg7‘~)° MEFs (Fig. 1c and Extended Data Fig. 2a-c). 
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Figure 1 | mTORC1 enhances protein degradation through an increase in 
proteasome levels. a, Tsc2 ‘~ MEFs expressing TSC2 or empty vector (Vec) 
were serum starved for 16h with vehicle or 20 nM rapamycin (Rap), and 
protein synthesis was measured with *°S-Met incorporation (20 min). Data are 
mean + standard error of the mean (s.e.m.) (1 = 3). **P < 0.01, +P < 0.05. 
b, Cells treated as in a were pulse labelled for 30 min and chased in medium 
containing vehicle or rapamycin. The rate of protein degradation is shown as 
the fraction of radiolabelled protein remaining over time. c, d, Cells were treated 
as in b, except that 10 1M chloroquine (c) or 0.02 1M bortezomib (d) was 
present in the chase media. b-d, Data are mean + s.e.m. (n = 3; note: small 
error bars are masked by line symbols). *P < 0.05, **P < 0.01, at 48h. e, The 
cells from a were serum starved for 16h with vehicle (Veh) or 20nM 
rapamycin, or treated for 2h with MG132 (MG; 0.1 1M) or bortezomib (Bort; 
0.2 uM). Proteasome activity is presented as mean + s.e.m. relative to vehicle- 
treated cells expressing TSC2 (n = 3). *P< 0.05, {P< 0.01. f, Cells were 
serum starved for 24h with vehicle (V), 20 nM rapamycin (R) or 2.5 uM PP242 
(P). Intact proteasome levels are presented as mean + s.e.m. relative to vehicle- 
treated TSC2-expressing cells (n = 3). Graphs are labelled as in e; *P < 0.05, 
+P <0.05, TTP < 0.01. a-e, Statistical significance for pairwise comparisons 
evaluated with a two-tailed Student’s t-test. 
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A role for the proteasome as the other major cellular mechanism of 
proteolysis was examined. As with rapamycin, two structurally distinct 
proteasome inhibitors, bortezomib and MG132, blocked the enhanced 
rate of protein degradation in TSC2-deficient cells (Fig. 1dand Extended 
Data Fig. 2d-f). TSC2-deficient cells possessed significantly higher pro- 
teasome activity relative to TSC2-expressing cells, which was attenuated 
by rapamycin and abolished with proteasome inhibitors (Fig. le). The 
levels of intact proteasomes were measured in four distinct isogenic pairs 
of cells, including Tsc2-’~ MEFs anda TSC2‘~ human angiomyolipoma- 
derived cell line expressing either empty vector or human TSC2, and 
HeLa and MCFIOA cells stably expressing control or TSC2-targeting 
short hairpin (sh)RNAs. TSC2 loss and mTORC1 activation resulted 
in a significant increase in cellular proteasome content in all four lines, 
which was reversed by rapamycin or the mTOR kinase inhibitor PP242 
(Fig. 1f and Extended Data Fig. 2g, h). 

To determine the mechanism of proteasome increase downstream 
of mTORCI, we focused on transcriptional regulation, as induced ex- 
pression of the genes encoding proteasome (PSM) subunits has been es- 
tablished as a major mechanism controlling cellular proteasome content”®. 
Interestingly, in a previous transcriptional profiling study, gene set en- 
richment analysis of mTORC1-induced transcripts found that the most 
enriched gene set was ‘Parkin disorder under Parkinson disease”. In 
examining the genes driving this enrichment score, we found that ten 
PSM genes scored as being significantly stimulated by mTORCI1 (Ex- 
tended Data Fig. 3a). We confirmed that mTORC1 activation induced 
the expression of messenger RNAs encoding subunits of both the 20S 
core and 19S regulatory complex of the proteasome in two independent 
sets of Tsc2-null MEFs and in HeLa cells with stable knockdown of TSC2, 
as well as wild-type MEFs stimulated with serum (Fig. 2a and Extended 
Data Fig. 3b-e). NRF1 has been demonstrated to induce the global ex- 
pression of PSM genes through direct binding of shared regulatory ele- 
ments in their promoters’”*. Importantly, short interfering (si)RNA 
knockdown of NRF1, but not the closely related NRF2 (also known as 
NFE2L2), blocked the mTORC1-dependent induction of PSM genes and 
led to a decrease in intact proteasome levels in multiple TSC2-deficient 


a b 
Tsc2- MEFs | PSMA (208) 
198 2.0 
+TSC2 5 * F 
7 2 ae 4 Tsc2-- MEFs siRNA: 
+Vec got tt « 
20s ~ t HB «tsc2 NRF1 
® 4 +Vec NRF2 
g 
2 05 ; 
[3s 3 Tubulin 
© oO 
VRP VRP 
PSMB1 (208) PSMB2 (208) PSMA1 
= 20 c 2.54 =< 25 
Ss se S * Ss pas 
Bae Q 2.0 = % 20 x 
£ + S45 5 15 
Qa Gls . 
5 1.0 it 3 tet 3 * 
® o 1.0 o 1.0 
= = r4 ao 
g 05 S05 B05 
o Oo Oo 
co co co 
VRP VRP VRP VRP ore 6 12 
PSMB5 (208) PSMD1 (198) PSMD1 
= 3.0 S < 4.0 = 35 
3 = q + B 3.0 = 
8 50 g 3.04 @ 25 
Qa 7 Qa 
g u 8 20 t ee 
2 1.0 Tr g tt g 40. 
z = 10 € 1 
is is asf 
pie a 
= 0 = 0 =O 
VRP VRP VRP VRP C12 C12 


0 
{-5 
x 
m 
Az 
ie 
8 
© 


HB :tsc2 [_]+Vvec ag 20 
o 
MEFs Angiomyolipoma Qe 154 x 3s ° 
* Pa £2 
_ 15 2.0 t QE 49 Be 
3 3 - se Ee 
2s Qe 15 22 os Be 
fe 10 Ro mio, ce 
$3 25 10 ~ 0 33 
58 05 88 Vee+ + --- - Bf 
52 22 05 = 
co co * His-NRF1-Flag - - + + - - 
a 5 a 0 NRF1-Myc - - - - + + 
© 1 oq ote C12 Rap = 4 = 4 = 


LETTER 


cell lines (Fig. 2b, c and Extended Data Fig. 4a—d). Reciprocally, exo- 
genous overexpression of two different Nrf1 complementary DNA con- 
structs led to elevated levels of intact proteasomes, which, unlike control 
cells, were resistant to the effects of rapamycin (Fig. 2d and Extended 
Data Fig. 4e). Like rapamycin, NRF1 knockdown blocked the enhanced 
rate of protein turnover in TSC2-deficient cells (Fig. 2e and Extended 
Data Fig. 4f, g). Collectively, these data indicate that NRF1 functions 
downstream of mTORC1 in promoting proteasome-mediated protein 
degradation. 

A rapamycin-sensitive increase in NRF1, but not in NRF2, protein 
levels was observed in multiple mouse and human cell lines lacking 
TSC2, as well as in HEK293 cells overexpressing RHEB, the downstream 
target of the TSC complex that activates mTORC1 (Fig. 2b and Extended 
Data Figs 4b, c, fand 5a-c)’. In wild-type cells, mTORC1 signalling is 
dependent on growth factors, and NRF1, but not NRF2, was upregulated 
in a rapamycin-sensitive manner over a time course of growth factor 
stimulation (serum, epidermal growth factor (EGF) or insulin) in wild- 
type MEFs, MCF10A and HeLa cells (Fig. 3a and Extended Data Fig. 5b, c). 
The mTORC1-mediated induction of NRFI1, through either growth 
factors or TSC2 loss, was reflected in an increase in both the unprocessed 
(p120) and processed (p110) isoforms of NRF1 (ref. 10), as seen most 
clearly on a NuPage gradient gel (Extended Data Fig. 5d). Rapamycin 
also decreased the protein levels of individual proteasome subunits, as 
well as an insulin-stimulated increase in intact 26S proteasomes (Extended 
Data Fig. 5e, f). 

Chronic mTORCI signalling can cause endoplasmic reticulum (ER) 
stress and activate the unfolded protein response (UPR)"’, an adaptive re- 
sponse that includes increased proteasomal degradation of ER proteins”. 
Indeed, TSC2-deficient cells displayed a rapamycin-sensitive increase 
in phosphorylation of the UPR effector PERK (also known as EIF2AK3) 
(Extended Data Fig. 6a). However, classical chemical inducers of ER 
stress, tunicamycin and thapsigargin, failed to induce NRF1 in wild-type 
cells. Consistent with previous studies”*, treatment of cells with protea- 
some inhibitors led to increased NRF1 protein levels, comparable to those 
seen in TSC2-deficient cells. However, in contrast to TSC2 depletion, 


Figure 2 | mTORCI induces proteasome gene 
expression and protein degradation through 
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RHEB overexpression and growth factor stimulation, the proteasome- 
inhibitor-induced increase in NRF1 levels was not reversed by rapamy- 
cin. Therefore, mTORC1 signalling increases NRF1 levels in a manner 
that is independent of both the UPR and the proteasome recovery 
pathway. Finally, we failed to detect effects of mTORC1 on the nuclear 
and cytosolic distribution of NRF1 (Extended Data Fig. 6b, c). 

The timing of NRF1 induction upon mTORC!1 activation (6 to 12h), 
and the fact that expression of NRF1 from an exogenous promoter led 
to elevated levels of NRF1 and proteasomes that were no longer sensitive 
to rapamycin, suggested that mTORC]1 might regulate NRF1 through 
transcriptional control mechanisms. Indeed, Nrf1 transcript levels were 
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Figure 3 | Growth factors stimulate an increase in NRF1 through mTORC1, 
which induces NRF1 transcription in an SREBP1-dependent manner. 

a, Tsc2./~ MEFs expressing TSC2 were serum starved for 16h and stimulated 
with 10% serum, 10 ng ml | EGF or 100 nM insulin for the indicated durations 
with vehicle or 20nM rapamycin (Rap). b, Nrfl and Nrf2 transcript levels 
from cells that were serum starved for 16 h with vehicle (V), 20 nM rapamycin 
(R) or 250nM of the mTOR inhibitor torin 1 (T) are shown as mean = s.e.m. 
relative to vehicle-treated TSC2-expressing cells (n = 3). Vec, vector. 

*P < (0.001 compared to vehicle-treated TSC2-expressing cells; {P< 0.01 
or ¢¢+P < 0.001 compared to vehicle-treated vector-expressing cells. c, ss RNA- 
transfected cells were treated as in b. Rap, 20nM rapamycin. Nrf1 transcript 
levels are presented as mean + s.e.m. relative to TSC2-expressing cells with 
control siRNAs (siCtrl; n = 3). ***P < 0.001 compared to TSC2-expressing 
cells, {P< 0.001 compared to vector-expressing cells with control siRNAs. 
d, NRF1 protein levels from cells treated as in c. e, NRF1 protein levels in 
HEK293 cells transfected with mature Flag-tagged SREBP1c or empty vector 
and serum starved for 16h with vehicle or 20 nM rapamycin. Phosphorylated 
(p) S6K1 is shown as a marker of mTORCI activity. f, ChIP from HEK293 
cells transfected as in e with anti-Flag (SREBP1c) or Pol II. Bound DNA was 
measured by polymerase chain reaction with quantitative reverse transcription 
(qRT-PCR) for the indicated promoter regions (left; 1, 2, 3) and normalized 
to control IgG immunoprecipitations (IPs). Data are mean + s.e.m. (n = 3). 
b, ¢, Statistical significance for pairwise comparisons was evaluated with a 
two-tailed Student's t-test. 
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elevated and sensitive to mTOR inhibitors in Tsc2-null cells (Fig. 3b). 
Amongst the transcription factors identified to be downstream of mTORC1 
(ref. 9), we found that the sterol regulatory element binding protein 1 
(SREBP1; also known as SREBF1) regulated Nrf1 gene expression. siRNA- 
mediated knockdown of SREBP1, but not SREBP2, decreased Nrf1 tran- 
script levels to a similar extent as rapamycin in both TSC2-deficient 
MEFs (Fig. 3c) and human angiomyolipoma-derived cells (Extended 
Data Fig. 7a), and SREBP 1 depletion also decreased NRF1 protein levels 
(Fig. 3d). Reciprocally, exogenous expression of mature, active SREBP 1c 
upregulated NRF1 and rendered its expression resistant to rapamycin 
(Fig. 3e). Analysis of previous genomic data sets from studies aimed at 
identifying targets of SREBP1, including genome-wide expression’ and 
chromatin immunoprecipitation (ChIP)", further suggested that the 
NRFI gene is directly regulated by SREBP1. Bioinformatic analysis of 
the human and rodent NRF1 loci identified four consensus and con- 
served sterol regulatory elements in proximity with the two predicted 
transcription start sites (Extended Data Fig. 7b). A ChIP assay demon- 
strated that mature SREBP1c bound to the NRF1 promoter, where Pol 
II binding was also enriched (Fig. 3f). As controls, SREBP 1c also bound 
to the promoter of its established target SCD but not to the promoters 
of GAPDH or NRF2 (Extended Data Fig. 7c). As mTORCI signalling 
increases the accumulation of active SREBP1 (ref. 9), these collective 
findings indicate that SREBP1 lies downstream of mTORC1 in the in- 
duction of NRF1 expression. 

We next examined mouse models of both genetic and physiological 
activation of mTORCI signalling in the brain and liver, respectively. A 
brain model of tuberous sclerosis complex involving a conditional hypo- 
morphic allele of Tsc2 (Tsc2°*""3) was used, in which exon 3 is deleted in 
neurons through Cre expression from the synapsin I promoter (Syn1- 
cre)'’. Brain lysates from Tsc2'’*, Tsc2~“"”” (heterozygotes) and 
Tscae e423: Syn 1 -cre (neuron-specific deletion) mice were compared. 
This allelic series showed a graded loss of TSC2 protein with a corres- 
ponding increase in mTORC1 signalling, elevated NRF1 protein and 
mRNA levels, with no change in Nrf2 expression, and a corresponding 
increase in PSM transcript and protein levels (Fig. 4a—c and Extended 
Data Fig. 8a). NRF1 has been shown to control PSM gene expression in 
hepatocytes’®. mTORCI signalling is strongly activated in the liver upon 
feeding’. This stimulation was associated with an increase in the pro- 
tein and mRNA levels of NRF1, but not NRE2, as well as representative 
PSM transcripts, and these were blocked with a single dose of rapamy- 
cin just before feeding (Fig. 4d, e and Extended Data Fig. 8b). These 
data provide in vivo support for NRF1 activation and proteasome in- 
duction downstream of mTORC1 signalling. 

We hypothesized that, in addition to serving as a quality control mech- 
anism for newly translated proteins, the enhanced proteasome activity 
upon mTORC1 activation could serve to maintain adequate pools of 
amino acids to sustain new protein synthesis. Indeed, while inhibition 
of translation with rapamycin or cycloheximide increased amino acids, 
proteasome inhibitors significantly depleted intracellular amino acids 
(Fig. 4f and Extended Data Fig. 9a). Likewise, two distinct siRNAs tar- 
geting NRF! elicited a significant decrease in intracellular amino acids, 
similar to bortezomib treatment (Fig. 4fand Extended Data Fig. 9b). De- 
pletion of intracellular amino acids upon Nrfl knockdown or bortezo- 
mib treatment was also reflected in a decreased rate of protein synthesis, 
which was much more pronounced under conditions of lower exogen- 
ous amino acids (Fig. 4g, h and Extended Data Fig. 9c, d). However, the 
differences in protein synthesis under low and high amino acid con- 
ditions were not reflected in differences in mTORCI signalling (Ex- 
tended Data Fig. 9e). Finally, TSC2-deficient MEFs and MCFI0A cells 
exhibited increased sensitivity to NRF1 knockdown (Extended Data 
Fig. 10a, b), indicating the importance of NRF1 induction for viability 
in the context of mTORCI activation. Collectively, our findings sug- 
gest a model whereby the mTORC1-stimulated expression of NRF1 and 
subsequent increase in cellular proteasome activity serve as a delayed, 
but pre-programmed, adaptive response accompanying increased pro- 
tein synthesis downstream of mTORC1 (Extended Data Fig. 10c). 
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Figure 4 | NRF1 is induced upon mTORC1 
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CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature13676 


Corrigendum: Systems survey of 
endocytosis by multiparametric 
image analysis 

Claudio Collinet, Martin St6ter, Charles R. Bradshaw, 
Nikolay Samusik, Jochen C. Rink, Denise Kenski, 

Bianca Habermann, Frank Buchholz, Robert Henschel, 


Matthias S. Mueller, Wolfgang E. Nagel, Eugenio Fava, 
Yannis Kalaidzidis & Marino Zerial 


Nature 464, 243-249 (2010); doi:10.1038/nature087 79 


Readers alerted us to a technical mistake (a shift of one row) made dur- 
ing the compilation of Supplementary Table 4, resulting in the assign- 
ment of 455 gene symbols to the wrong gene IDs. For the affected 
genes, the cluster group, the number of positive small interfering and 
endoribonuclease-prepared RNAs (si/esiRNAs) per gene and the Phe- 
noscore values were assigned to the wrong gene symbol (although they 
were all assigned to the correct gene ID). All data in the online data- 
base were correct. The Supplementary Information of this Corrigen- 
dum contains the corrected Supplementary Table 4. We have also 
redesigned our online database (containing all images and profiles) 
to provide a more user-friendly interface, a visual representation of the 
phenotypic profiles, a facilitated gene search and integration with other 
public internet databases. The appropriate link on page 245 of the Article 
should therefore be http://endosomics.mpi-cbg.de. 


Supplementary Information is available in the online version of this Corrigendum. 
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CORRECTIONS & AMENDMENTS 


RETRACTION 
doi:10.1038/nature13588 


Retraction: Genomic organization 
of human transcription initiation 
complexes 

Bryan J. Venters & B. Franklin Pugh 


Nature 502, 53-58 (2013); doi:10.1038/nature12535 


We reported the presence of degenerate versions of four well known 
core promoter elements (BRE,, TATA, BREg and INR) at most measured 
TFIIB binding locations found across the human genome. However, it 
was brought to our attention by Matthias Siebert and Johannes Séding 
in the accompanying Brief Communication Arising (Nature 511, El1- 
E12, http://dx.doi.org/10.1038/nature13587; 2014) that the core-promoter- 
element analyses that led to this conclusion were not correctly designed. 
Consequently, the individual core promoter elements were not statisti- 
cally validated, and therefore there is no evidence of specificity for most 
reported core-promoter-element locations. To the best of our knowledge, 
the raw and processed human TFIIB, TBP and Pol II ChIP-exo data are 
valid, but subject to standard false discovery considerations. We there- 
fore retract the paper. We sincerely apologize for adverse consequences 
that may have arisen from the error in our analyses. 
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BRAEDOSTOK /SHUTTERSTOCK 


TECHNOLOGY FEATURE 


WHEN DISEASE STRIKES 
FROM NOWHERE 


When healthy parents have a child with a genetic disorder, the cause is sometimes anew 
mutation. Tools are emerging to meet the challenge of finding such changes. 


Children born with disorders not readily explained by standard tests can sometimes be diagnosed through genome sequencing and analysis. 


BY VIVIEN MARX 


hen parents find that a child is 
not developing as expected, the 
protracted doctor visits, hospi- 


tal stays and examinations only add to their 
distress — especially when no other family 
member has the condition and the standard 
tests on the child’s blood and genes shed no 
light on the cause. The uncertainties, costs 
and anguish can be devastating to families, 
says Michael Friez, who directs the diagnostic 
laboratory at the Greenwood Genetic Center 
in South Carolina, a non-profit organization 
that analyses patients’ genomes for clinicians. 


Every clinical geneticist has experienced 
the inability to identify the cause of a child’s 
neurodevelopmental disorder, adds Roger 
Stevenson, a clinical geneticist also at the 
centre. In the early 2000s, he began seeing a 
family with a toddler that had severe develop- 
mental problems, including a smaller-than- 
average head and intellectual disability. 

It was more than a decade after their first 
visit before sequencing revealed that the boy 
had a mutation in a gene called DYRKIA, 
which is thought to have a role in brain devel- 
opment. The finding later helped to diagnose 
16 other children in the United States and 
Europe who had the same symptoms — and 


although the condition has no cure, Stevenson 
saw that identifying the gene comforted the 
boy’s parents, as did knowing that there were 
other children like their son. 


NEW MUTATIONS 

What was notable about this child’s case was 
that it involved a de novo mutation — one 
that neither parent carries in their regular 
complement of DNA. De novo mutations can 
occur early in the development of the embryo. 
They can bein parents’ gametes. Around 80% 
of de novo mutations seem to occur in the 
father’s sperm and 20% in the mother’s egg, 
says Joris Veltman, a geneticist at Radboud > 
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> University Medical Center in Nijmegen, the 
Netherlands, who in July published a study of 
de novo mutations in people with intellectual 
disabilities’. 

Disorder-causing de novo mutations are 
hard to detect — they have to be identified 
among a host of other, innocuous genetic 
changes. A number of software-based 
approaches are emerging to sift through 
sequenced genomes in search of such 
mutations. 

As sequencing instruments and databases 
of genetic information become increasingly 
available, tool-builders hope that their soft- 
ware contributions can become part of routine 
medical care. But sequencing and analysis are 
different from, say, a blood cholesterol test — 
samples have to be prepared for the instru- 
ments, which churn out the genome sequence 
in snippets that must be assembled and aligned 
to a reference genome, such as that curated by 
the Genome Reference Consortium. 

The results are not 


perfect. A patient’s “Thereisno 
genome sequence single magic 
can contain errors trick that makes 
— caused by the ourmethod 


machine misreading 
a letter of DNA, for 
example — that must be filtered out compu- 
tationally. And even then, a huge number of 
possibilities remains. DNA bases might differ 
from the reference, sequences can be inserted 
or deleted and the number of copies ofa gene 
can vary. Of thousands of such changes, only 
one might have a role ina disorder. 

The child’s DNA is then compared with 
that of the parents. Again, not all differ- 
ences between their genomes connect to the 
child’s disorder. Researchers use software 
that includes statistical analyses to determine 
which changes are most likely to have a role. 
And the tools add information, such as pub- 
lished data about the links between genes and 
disease. These results help to create lists of 
genetic changes, or variants, ranked by likeli- 
hood of being linked to a disorder. But variant 
analysis is still an emerging science, and the 
software tools are still maturing. Despite this, 
in some cases the approach turns up a specific 
genetic change that is likely to be the cause of 
a disorder. 


work well.” 


ASSORTED VARIANTS 
Finding the probable genetic culprit does not 
mean a treatment is available. But such results 
help parents to cope with the situation, says 
Donald Conrad, a geneticist at Washington 
University in St Louis, Missouri. The results 
also inform parents about the risk that the 
condition might recur in their family and help 
them to plan future pregnancies. And some 
prospective parents might opt for genetic 
analysis as part of in vitro fertilization. 

Most newborns carry about 60-100 de novo 
variants, says Conrad — few of which cause 


any discernible problem. Software helps to 
sort these variants out. Conrad has developed 
DeNovoGear, which does statistical analysis to 
distinguish potentially important signals from 
background noise caused by experimental 
error. The software also analyses the nature 
and frequency of sequencing errors. It then 
compares the genomes of parents, children 
and other family members to distinguish true 
de novo mutations from other types of genetic 
variation. 

To improve the odds of finding such muta- 
tions, the analysis takes into account the 
frequency of known variation at a given site 
in the genome. It does so by drawing on data 
from the 1000 Genomes Project, an interna- 
tional research consortium that catalogues 
human genetic variation. “There is no single 
magic trick that makes our method work 
well” Conrad says. “It is just the accumulation 
of many different attempts to squeeze out as 
much information as possible” 


RAISING THE ODDS 

The software must contend with the errors 
made by the sequencing instruments — 
reporting a ‘C’ asa “T’, for example. These mis- 
takes are rare but hard to predict, says Conrad, 
and may explain many false-positive results in 
searches for de novo mutations. High-through- 
put sequencing instruments are more prone 
to error in some DNA regions — which also 
turn out to be where cells are more likely to 
make mistakes when copying and repairing the 
genome. These tricky places account for about 
15% of the genome, Conrad notes, so current 
methods can reliably detect de novo mutations 
only in the other 85%. 

Even the best software tools come up with 
2-3 times as many false positives as true 
positives when analysing whole-genome 
sequence. True positives have to be teased 
out with follow-up experiments — for exam- 
ple, by using the laborious but precise Sanger 
sequencing method to look at the genetic 
region in question. “Each sequencing plat- 
form has its own idiosyncrasies, Conrad says, 
and the optimal method for detecting de novo 
mutations needs to incorporate the machine's 
quirks into its statistical models. 

Conrad is also developing statistical 
methods that take account of the frequency of 
various sequencing errors in different regions 
of the genome. Other software tools typically 
apply the same error estimates at all genomic 
sites. Other researchers are pursuing their own 
approaches. 

For the study published in July’, Veltman 
and his colleagues sequenced and analysed 
the genomes of 50 people with severe intel- 
lectual disabilities (see “Better diagnosis’). 
Working with Complete Genomics (CG) 
in Mountain View, California, a division of 
the genomics giant BGI in Shenzen, China, 
they identified de novo mutations by drawing 
on a number of resources. They used BGI’s 


GENOMICS 


a i 
Martin Reese, chief scientific officer of Omicia. 


technology and software to analyse and com- 
pare genomes and whittle down the number 
of possible disease-causing candidates. They 
did not analyse the data with other tools such 
as DeNovoGear, so Veltman cannot compare 
the methods. But the advantage with BGI’s 
analysis suite is that the software has been 
matched to the specifications of the sequenc- 
ing technology, he says. 

All the patients in the study had previously 
undergone extensive testing. Protein-coding 
regions of their genomes had been analysed, 
and microarrays were used to analyse varia- 
tions in gene-copy number, which can occur 
from person to person and also in some dis- 
orders. Veltman says that the software showed 
high sensitivity in detecting de novo mutations, 
which enabled a more accurate diagnosis of 
almost half of the patients. 


INTERPRET CAREFULLY 

In Veltman’s view, interpreting mutations is 
now more possible for disorders such as intel- 
lectual disability than for diseases such as can- 
cer or diabetes, because many cases of severe 
intellectual disability seem to be caused by a 
single mutation. But, he says, the few hundred 
genes that the scientific community has found 
to be implicated in intellectual disability form 
a still-incomplete list. 

Veltman stresses that the sequencing qual- 
ity in the study was good, but says that even 
the best sequencing technology can miss 
or misidentify de novo mutations. To mini- 
mize errors, researchers need to seek out the 
highest-quality genome sequencing, he says. 
Beyond that, interpreting the many genetic 
variations that turn up when comparing 
genomes — and figuring out which ones are 
related to a disorder — is the field’s major 
bottleneck. Researchers also need to find 
better ways to analyse de novo mutations in the 
genome’s non-coding regions, which are still 
difficult to interpret. 

One suite of tools to analyse protein-coding 
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GENOMICS 


and non-coding genome regions is FastQFor- 
ward, which integrates the software programs 
VAAST’, pVAAST" and Phevor’. These tools 
were co-developed by Mark Yandell, a com- 
putational geneticist at the University of Utah 
in Salt Lake City who directs software develop- 
ment and computational analysis related to the 
Utah Genome Project. That project combines 
family histories from the Utah Population 
Database with medical records, which increas- 
ingly include DNA sequence information. 
The project includes family histories for more 
than 7 million people and medical records for 
around 4 million of them. 

Yandell and his team are using pVAAST to 
analyse family pedigrees in which there is a 
higher frequency of disease. pVAAST searches 
through many genomes in parallel to find 
alterations. The program addresses the sta- 
tistical challenge presented by genomes from 
people who are related, he says. And it detects 
de novo mutations. 

Printed out, the large family pedigrees in 
the Utah database can span almost 2 metres. 
The ones Yandell is studying include multiple 
family members that have mental-health issues 
such as schizophrenia or depression. Mental 
illness has a large environmental component, 
but he hopes that these records can help to 
uncover genetic factors, he says. 

Studying families might offer advantages 
over the more typical ‘cohort analyses’ of 
unrelated people with similar conditions. 
In such cohorts, the causes of mental-health 
problems might be quite diverse. Yandell hopes 
that restricting the search to extended fami- 
lies will make it easier to identify gene variants 
involved. 

VAAST uses a similar approach to that of 
BLAST, a widely used search tool in genetics 
research®. With BLAST, a scientist can take a 
genetic sequence and search through many 
genomes to find high-probability matches to 
it. Similarly, VAAST compares variants in a 
person's genome to those collected in the 1000 
Genomes Project. This comparison helps to 
determine the probability that a variant is caus- 
ing a disease. 

pVAAST extends VAAST’s capabilities to 
family-based sequence data. Yandell also uses 
Phevor, which taps into resources such as the 
Human Phenotype Ontology, which cata- 
logues links between gene function and human 
disease symptoms. 

Phevor helped clinicians to diagnose a 
12-year-old boy who had life-threatening 
diarrhoea and intestinal inflammation. 
Genetic analysis with VAAST had come 
up empty. By combining the analysis with 
Phevor, the researchers traced the boy’s ill- 
ness to a de novo mutation in STAT1, a gene 
involved in many intestinal disorders. The 
finding, which was confirmed with Sanger 
sequencing, enabled the boy’s doctors to 
properly treat and stabilize his condition. 

Yandell hopes that genetic analysis will soon 


bea routine part of clinical diagnosis. Towards 
that aim, he and Martin Reese, a co-developer 
of VAAST, developed Opal, a platform that 
helps clinicians to interpret and use the results 
from software-based genetic analyses. Reese is 
chief scientific officer of Omicia, a company in 
Oakland, California, that offers genetic analy- 
sis using several tools, including Opal, VAAST, 
pVAAST and Phevor. 

Reese says that his company tries to fill the 
gap between tools developed in academia 
and the needs of clinicians. The VAAST algo- 
rithm does the hard-core maths to analyse the 
matches, score their probabilities and create 
a ranking, he says. The Opal software then 
searches for clinical and biological data about 
the candidate genes — added information that 
can help to determine which candidates are 
more likely to be causing the disease. 

In June, Omicia began working with Labora- 
tory Corporation of America, a large medical- 
testing company based in Burlington, North 
Carolina. Omicia will interpret genomic data 
as part of clinical trials. 

Data analysis is Omicia’s specialty. Unlike 
many other companies in the field, it does not 
do sequencing. “We're slicing and dicing the 
genome based on your clinical question,” says 
Reese. His team first assesses the sequence 
quality and filters out typical sequencing errors 
before hunting for changes such as de novo 
mutations. 


FUTURE MEDICINE 

Eventually, clinical standards in this area will 
emerge, but for now service providers use the 
approaches they deem to be best for these 
complex analyses. Reese believes that many 
diseases, if not all of them, have contributions 
from de novo mutations. These contributions 
are hard to identify, he says, but whole-genome 


CASE STUDY 
Better diagnosis 


In a study published in July’, a team of 
researchers in the United States and 

the Netherlands analysed the genomes 
of 50 people who had intellectual 
disabilities that could not be explained 
through standard tests. DNA sequencing 
showed that each person had more 
than 4 million single-base variations and 
more than 250 copy-number variations. 
Computational analysis and comparison 
of the patients’ genomes with those 

of their parents whittled the number 
down to some 60 de novo mutations 

per person, of which 1-2 were in genes 
identified by the software as having a 
connection to intellectual disability. The 
genomic analysis led to a more accurate 
diagnosis for 20 of the 50 patients. 
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analysis raises the probability of finding them, 
as Veltman’s study shows. 

Conrad says that detection of de novo muta- 
tions can bea standard medical test only when 
the genetic complexities of diseases they cause 
are better understood and tool developers have 
found ways to address them, and after the 
technical issues related to high-throughput 
sequencing have been resolved. 

Between 20% and 90% of the de novo muta- 

tions detected by 


“We’re slicing software and with 
and dicing the the help of whole- 
genome based genome sequencing 
onyour clinical can be false positives. 


“Researchers can 
accommodate this 
with extensive follow-up validation experi- 
ments, but this is just simply not practical for a 
routine diagnostic test,’ Conrad says. 

Better approaches are also needed for the 
tough-to-sequence regions of the genome, 
and the software has to cover the spectrum 
of mutations, from single-base changes to 
insertions or deletions. And researchers need. 
to better understand changes such as large 
copy-number variations, regions of repetitive 
sequence and other types of DNA rearrange- 
ments, says Conrad. 

Greenwood Genetic Center, which uses 
genetic analysis to diagnose patients, does its 
own analysis and uses commercial services. 
Scientists and companies doing genetic analy- 
sis will soon have access to many of the same 
shared resources, and Friez says that he looks 
forward to seeing how that will help patients 
with neurodevelopmental disabilities. For 
now, patients, their families and clinicians 
all face the same issue: researchers’ ability to 
identify mutations associated with disorders 
is not always matched by a medical under- 
standing of these mutations, and therapies 
that might arise from knowing about them 
are far in the future. 

But genetics does deliver some answers 
for these patients and families, says Velt- 
man. “From what I hear from my clinical 
colleagues, these families are very happy to 
finally get an answer — it often means clo- 
sure for them, they can give the disorder in 
their child a place and better accept it,” he 
says. “In regards to therapy and treatment, 
unfortunately options are still quite limited, 
but progress is being made.” m 


question.” 


Vivien Marx is technology editor for Nature 
and Nature Methods. 
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EARLY-CAREER FUNDING 


Big introductions 


Europe’s Starting Grants are ideal for young researchers 
with big ideas and what it takes to bring them to life. 


BY QUIRIN SCHIERMEIER 


n a mid-April afternoon in 2012, 
() Francesco Ricci boarded a plane in 

Rome ina highly anxious state — but 
not because he was afraid of flying. He knew 
that the ten-minute presentation he was giv- 
ing the next day in Brussels could propel his 
career into orbit. All he had to do was to per- 
suade a panel from the European Research 


Council (ERC) to fund his application for a 
Starting Grant. 

A postdoc in the chemistry department of 
the University of Rome Tor Vergata, Ricci had 
spent the greater part of two months working 
on his proposal to develop DNA-based nano- 
devices for diagnosis and treatment of cancer. 
After he was invited to Brussels, he prepared 
his talk and rehearsed it time and again in 
front of colleagues, encouraging them to ask 


the trickiest questions they could think of. He 
knew that his chances of success at winning a 
grant hinged on him giving an effective pres- 
entation. 

In the hall the next day, where a dozen 
hopeful applicants waited to be called, a tense 
silence prevailed. “You try to figure out if the 
others are better scientists than you and what 
their projects might be about,” Ricci says. 
“You know that it’s possible that fewer than 
half of the people in the room will get a grant.” 

The chance to win a €1.5-million (US$1.9- 
million) Starting Grant is a thrilling prospect 
for young scientists — and for a postdoc in 
Italy, where funding and career opportunities 
for young scientists are notoriously poor, it 
can be the chance of a lifetime. An ERC grant 
— whether a Starting Grant or a Consolida- 
tor Grant for researchers at more advanced 
stages — means a good couple of years dur- 
ing which winners do not need to worry too 
much about funding. That is a rare luxury in 
today’s research environment, and one that 
can empower research output and scientific 
reputation. 

Launched in 2007 as the European 
equivalent of the US National Science Foun- 
dation, the ERC has quickly turned into 
the flagship funding programmes of the 
European Union (EU). More than €13 bil- 
lion is earmarked for the agency in the 
EU’s €80-billion Horizon 2020 research 
programme — the region’s biggest science- 
funding and innovation programme ever, 
launched in January. The Starting Grant pro- 
gramme, aimed at promising EU-based early- 
career researchers of any nationality, appeals 
to young scientists because the application 
rules are straightforward and it favours basic 
science. But it is highly selective, and success 
demands time, effort, rigorous preparation — 
and, perhaps, a bit of luck. 

More than 4,500 ERC grants — including 
2,330 Starting Grants — have been awarded 
since 2007. Typically, the ERC invites two to 
three times as many applicants as will receive 
funding to give a presentation. The success 
rate rose from around 3% in the first call to 
about 15% in 2010, but fell back down to just 
9% last year, owing to a sharp increase in the 
number of applications. 

This year, almost 3,300 scientists applied, 
and around 330 will be selected for fund- 
ing. The next call opens on 7 October and 
will close on 3 February 2015 (deadlines are 
strictly enforced). The €430 million reserved 
for this call — about one-quarter of > 
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> the ERC’s budget for the year — will be 
distributed among roughly 330 people. The 
amount to be disbursed is slightly larger than 
last December’s initial estimate of €411 mil- 
lion because it expects an equally high number 
of applications next year and it aims to keep 
the success rate consistent from year to year. 

Ricci’s 10-minute presentation went well — 
and so, he felt, did the subsequent question- 
and-answer session with panel members. Yet 
he did not make the cut: an e-mail in July 
told him that his project had not been rec- 
ommended for funding. 

He was disappointed but did not give up. 
The reviewers comments were not altogether 
discouraging, so the following year he sub- 
mitted a slightly altered proposal that empha- 
sized the medical potential of his research. 
That May, he returned to Brussels. He was 
even more nervous than in the previous year: 
it was his last chance, because applicants can 
reapply only once. This time, he felt that the 
presentation and subsequent question-and- 
answer period went less smoothly, but his 
persistence was rewarded: after an anxious 
two-month wait, he learned that his project 
had been funded. 


OPEN TO ALL 
The ERC supports researchers, irrespective 
of nationality, age and gender, who will be 
employed at, or affiliated with, host insti- 
tutions in the EU or associated countries 
(including Iceland, Israel, Norway and Tur- 
key) to conduct their research. The Starting 
Grant is one of three that it offers; the others 
are consolidator grants for researchers a bit 
further along their career path and advanced 
grants for eminent senior scientists. Deci- 
sions are based on the scientific merits of the 
applicant and the ambition and feasibility of 
the project. 

Applicants for starting grants must have 


completed their PhD or equivalent 2-7 years 
before the call (time taken off for maternity 
or paternity leave, clinical training, long-term 
illness or national service are not part of the 
count) and must have published at least one 
paper without co-authorship of their PhD 
supervisor. There is no co-financing required 
from host institutions, and award recipients 
do not have to team up with other groups or 
companies, as in most other EU research pro- 
grammes. There are also no thematic priori- 
ties: scientists in any field, including the social 
sciences and humanities, are equally eligible 
for funding. 

The foremost criterion, says José 
Labastida, the ERC’s science director, is a 
unique research idea that has the potential 
to substantially advance the knowledge in 
a given field. Reviewers also want to know 
that a candidate has what it takes to get the 
work done. “Apply for an ERC grant only if 
you have a really new and ambitious idea 
and you can demonstrate that you have the 
potential to pursue it with a team of your 
own,’ Labastida says. “A proposal that smells 
of incremental research, or just more of the 
same stuff that you have been doing before, 
is not going to fly.” 

It can be challenging for scientists in the 
early stages of their career to start thinking 
that big. Taking on full ownership of a project 
— for many, a completely new experience — 
is no easy task. 

Candidates should also make sure that 
their CV and summary — submitted along 
with a full project description — meet the 
ERC criteria (see “The nuts and bolts of your 
application’). During the first judging stage, 
reviewers will look only at these components. 
“The majority of reviewers on the panel will 
not be experts in your specific line of research 
— so send the key message to a wide audience 
and leave the details to the full proposal,” says 


THE RIGHT CV 


The nuts and bolts of your application 


To apply for a European Research Council 
Starting Grant, you must follow specific 
guidelines. Here is a summary of what you 
need to do — and not do. 

@ Your CV should be written in English and 
not exceed two pages. 

@ It should give a complete account of 
your academic record, including the years 
of your master’s (if applicable) and PhD 
programme and the name of the university 
(or universities) and department (or 
departments) that awarded them. 

@ It should also include your full name and 
date of birth, and ought to include the URL 
for a current personal website, although this 
is not obligatory. 


@ It should document previous and current 
academic positions, fellowships and 
awards, teaching activities, institutional 
responsibilities, memberships in scientific 
societies and any major collaborations. 
@ It should clearly explain any educational 
or employment gaps, or unconventional 
career paths. 
@ All ongoing and submitted grants and 
funding can be detailed on a separate page. 
@ It should list only five representative 
publications. Any other relevant papers 
should be listed in the early-achievements 
track-record section of the application. 

A model CV and application forms are 
available at go.nature.com/hjn9vwy. 0.8. 
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Chief scientists Veerle Huvenne (right) and 
Aggeliki Georgiopoulou plan their next expedition. 


Labastida. The summary should outline the 
idea and explain why the research is impor- 
tant and scientifically feasible in no more than 
five pages. When possible, applicants should 
provide preliminary data. Having friends 
and scientists who are unfamiliar with the 
research read it and note what they find hard 
to understand will help applicants to make the 
message clear, he says. 

The CV should highlight relevant skills 
and scientific achievements. It should list any 
stints at high-profile universities outside the 
applicant’s home country, and previous grants 
or participation in EU-funded research and 
mobility programmes may help to show com- 
mitment and scientific potential. 


THE RIGHT TIME 
Marine geoscientist Veerle Huvenne of the 
National Oceanography Centre in Southamp- 
ton, UK, had been the chief scientist in several 
international research cruises before decid- 
ing in 2009 that the time was ripe to apply 
for a Starting Grant. She had already gath- 
ered experience in managing a collaborative 
marine-research project during a Marie Curie 
fellowship from 2005 to 2007. She wanted to 
use the ERC money to map coral habitats and 
study the biodiversity they support. 
Huvenne says that advice from team 
members and former collaborators was 
extremely helpful in securing the €1.4-million 
grant. “Take your time to develop a really 
strong idea, let it mature and discuss it inten- 
sively with colleagues and peers before you 
start drafting a proposal,’ she advises. “Make 
sure the synopsis is rock solid. If you're invited 
to an interview, prepare rigorously. It’s a major 
effort, but there is a lot to gain for your whole 
career. 


DIMITRA SALMANIDOU 


FLORIAN RAIBLE 


Kristin Tessmar-Raible (right) and PhD student Juliane Zantke work on how lunar cycles affect the 
activity of worms. 


Compelling and intelligible writing make 
a difference when reviewers flooded with 
applications are trying to make their choice 
— and honest feedback from colleagues can 
really help, says Kristin Tessmar-Raible, a 
neurobiologist at the University of Vienna 
in Austria. Tessmar-Raible was awarded 
a Starting Grant 
in 2013 for inves- 
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ing of the headline 
and sentences and 
the order of paragraphs and tables can be 
vastly improved by input from friends, col- 
leagues and professional proofreaders, who 
can all offer tips on how to make a synopsis 
catchy and a full proposal concise and well- 
structured, she says. In addition, national 
ERC help desks in all EU countries offer 
grant-writing assistance, and some also 
offer interview-training courses. 

Less is often more. “Avoid unnecessary 
information in a CV that might only con- 
ceal the things that really matter,” says 
Ricci. Likewise, he says, presentations that 
are padded with data and technical details 
tend to be more confusing than informative. 
“You have only ten minutes to describe your 
grand vision and ambition, says Tessmar- 
Raible. “Every second counts, so you have 
to think hard about every word and every 
slide that you use.” 

Applicants should also carefully consider 
which of the ERC’s 25 panels they would like 
to evaluate their proposal. “If you end up 


being reviewed by the ‘wrong’ panel it might 
diminish your chance of getting funded,’ 
says Erik Garnett, a physical chemist at the 
Institute for Atomic and Molecular Physics 
in Amsterdam. Garnett had moved there in 
2012 from Stanford University in California 
with little knowledge of the funding situa- 
tion in Europe. On the advice of the director 
of his new institute, he applied for a Start- 
ing Grant to develop nanomaterials that can 
be used to make high-efficiency solar cells 
— his first grant proposal ever — and suc- 
ceeded. Before he submitted his application, 
he looked up the CVs of panel members of 
previous calls to get a feel for whether their 
expertise overlapped with his research. He 
opted for the Material and Synthesis panel 
because its members seemed to have more 
affinity for his work than did others. 

Starting grants could well galvanize 
researchers’ careers, says Huvenne. She 
herself is a good example: last summer, 
two years after her ERC-funded project 
launched, she was promoted from senior 
research fellow to team leader for the sea- 
floor and habitat mapping group at her 
institution. She was surprised by how many 
scientists have since got seriously interested 
in her research. 

And in Italy, universities can hire 
Starting-Grant winners without following 
the country’s historically twisted routes to 
academic appointment. Winning an ERC 
grant provided Ricci a springboard to a 
permanent position. Twelve months after 
his second return from Brussels, he was 
promoted to associate professor. m 


Quirin Schiermeier is a senior reporter 
with Nature in Munich, Germany. 
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CAREER PROGRESSION 
Europe on track 


Tenure is gaining traction in Europe 

even as the system is slipping away in 

the United States, according to a study 

by the League of European Research 
Universities in Leuven, Belgium. The 
study surveyed 21 universities throughout 
Belgium, Finland, France, Germany, Italy, 
the Netherlands, Sweden, Switzerland, 
Spain and the United Kingdom. It found 
that seven nations are now using tenure 

as a way to recruit internationally and to 
offer researchers a clearer career path. The 
paper defines tenure-track as a fixed-term 
contract that can lead to a permanent 
position. Institutions surveyed in the 
United Kingdom, France and Spain do 
not have tenure systems. Meanwhile, the 
proportion of tenure-track positions in 
the United States has declined in the past 
30 years, notes the study. 


EDUCATION 
Degrees of difference 


Fewer than one-quarter of people aged 
25-64 in the 34 member nations of the 
Organisation for Economic Co-operation 
and Development (OECD) earned a 
degree in 2012, finds an OECD report. 
Education at a Glance 2014: OECD 
Indicators examined education attained 
by adults in Europe, North America, 
South America and Asia. The report 
found that at least one-third of adults 
aged 25-64 in the United States, Norway 
and Israel had earned a degree. Chile 
and Austria had the lowest rates at 12% 
and 13%, respectively. Other nations 

fell between these rates. The average 

age for completing a doctoral research 
programme across the member nations 
was 35. Korea reported the oldest age of 
40; Germany the youngest at 31. 


RESEARCH AND DEVELOPMENT 
Falls in funding 


US federal spending on scientific 
research and development is projected to 
have fallen by 4% from 2011 by the end of 
this year, according to a report from the 
US National Science Foundation (NSF) 
in Arlington, Virginia. The report, which 
collected data from the 27 US science- 
funding agencies, shows that spending 
reached US$140 billion in 2011 and is 
expected to slip to $134 billion this year. 
The 2014 total is likely to be even lower, 
says an NSF spokesperson, because it 
does not account for a 2013 across-the- 
board cut to discretionary spending. 
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Ua SCIENCE FICTION 


BY REBECCA ROLAND 


wen hated this part of the job: get- 
(GS a memory extraction from a 
fresh witness. 

The woman sat on the other side of the 
table inside the tent the Chicago PD had hast- 
ily erected near the shooting in the park. She 
was silent, stunned, covered in blood spat- 
ter. A cup of coffee sat untouched in front 
of her, its smell only somewhat covering the 
tang of blood, the sour stench of terror. Gwen 
glanced at the paperwork. Meghan Johnson. 
Young. Pretty. Meghan’s hands curled around 
the cup. She wore a plain wedding band. 

The extractor sat on the table, the steady 
green light indicating it was primed and 
ready. It looked like a mesh helmet equipped 
with a tiny pack that would nestle over the 
forehead. 

Gwen cleared her throat. Mrs Johnson? 
No, no need to remind her of the man who 
should’ve been sitting next to her, and too 
formal besides. “Meghan?” she said softly. 

The woman could have been a statue, she 
sat so still. 

“Meghan,” Gwen said louder. 

The woman twitched, looked up. She 
swayed in her seat. “Yeah.” Flat. 

“It’s time.” Gwen lifted the extractor and 
carried her chair around the table so that she 
sat beside Meghan. 

The other woman leaned back, eyes wid- 
ening. “N-now?” 

“Tm afraid so. The quicker I retrieve your 
memory of the crime, the better the qual- 
ity. It will make all the difference when it 
comes time for the trial.” She laid a hand on 
Meghan’. The other woman was frigid. “And 
you won't have to relive this again. I retrieve 
the memory, and then I erase it.” The erasure 
was the only decent part of this whole pro- 
cess. Who wanted to recall being brutalized, 
or watching someone hurt a child, or seeing 
a loved one die? 

“Does it make it easier?” Meghan asked. 
“Erasing it?” 

“Tt helps” 

“You speak from experience?” 

Gwen hesitated. She'd gone through it 
once, as every extractor did: to know what 
it was like, to better sympathize with vic- 
tims, shed erased a memory of her getting 

a speeding ticket. 
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EXTRACTION 


Total recall. 


most conversations with her mother, who 
felt the overwhelming urge to criticize every- 
one. She understood that those things had 
happened, but she didn’t have to carry them 
around like she used to, and when she spoke 
with her mother, or had a bad day at work, 


she consoled herself with the fact that she 


could just make it all disappear. Using the 
extractor left her with the same feeling as a 
couple of glasses of wine: pleasantly sleepy 
and numb, with all the bad feelings shoved 
far down where she didn't have to deal with 
them. “Yes.” 

Meghan’s eyes shimmered behind unshed 
tears. “Do it” 

Gwen had her sign the paperwork, then 
fitted the extractor over Meghan’s head. She 
pulled out her tablet and keyed in the instruc- 
tions. She watched every extraction to ensure 
it was a useful memory for prosecuting the 
bastards later. Then she usually erased them 
from her memory. She didn’t want her own 
baggage, much less anybody else’s. 

In Meghan’s memory, she sat beside a hand- 
some Latino man on a blanket in the park, 
only a dozen feet from the stage where a band 
played jazz. The details were sharp, from 
the quality of the sound to the way Meghan 
recalled the two women just in front of them 
swaying slightly with the music, to the lights 
glowing yellow in the distance along the path. 
She was a great witness, and she had the per- 
fect view of the gunman as he approached 
and opened fire. Gwen winced. No wonder 
the poor woman was a mess. 

Then came the moment when Meghan 
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hovered over her husband. Blood burbled 
from his mouth and covered a wide swathe 
of his white dress shirt. 

He reached for Meghan. She grasped his 
hands. “Love... you,” he gasped. 

“T love you,” Meghan sobbed, over and 
over, until his eyes turned glassy. 

The extraction stopped. Gwen was 
amazed at how calm her hands remained as 
she saved the images and sounds. 

Meghan shuddered violently as if chilled 
to her bones. 

Gwen's finger hesitated over a key. “Are 
you ready to remove the memory?” 

She hesitated. 

“You don’t have to see him shot ever 
again. Not in your memories, not in your 
nightmares.” 

Her hands closed, opened, closed in her 
lap. “You remove everything?” 

“Everything” 

“Can you leave his last words to me?” 

“T can't. ’'m sorry.” 

‘Tl keep it” 

“What?” Gwen nearly hit the button out 
of surprise. 

“Tl keep it,” she whispered. “Otherwise 
his last words will be about our stupid mort- 
gage. ll keep it” 

“Theres no need to live with that memory.’ 

Meghan met her gaze. Her green eyes were 
red-rimmed and determined. “It’s worth 
some suffering to know that he spent his last 
breaths letting me know that he loved me.’ 

Gwen's hand shook as she shut off the pro- 
gram and removed the extractor. Nobody 
had ever refused an erasure before. “I'll show 
you to acounsellor, if youd like.” 

“T would.” 

Gwen led Meghan out of the tent. She 
held her chin high, hands steady, the com- 
plete opposite of the wreck she'd been when 
shed entered. Gwen passed her to a waiting 
counsellor, then turned the extractor over in 
her hands. This was normally the time shed 
erase her own memories of the scene, of the 
witness's testimony. So I can stay sane, she 
always told herself. Nothing good ever came 
from bad moments. 

Except, a moment of love could sneak in. 
Maybe shed been erasing good along with 
bad all this time. She went back into the tent 
and packed the extractor carefully away. = 


Rebecca Roland is the author of Shards of 
History and the short-story collection The 
King of Ash and Bones, and Other Stories. 
Find out more at rebeccaroland.net. 
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